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Summary 


This  work  focuses  on  the  problem  of  adaptive  receiver  design  for  direct  sequence  spread-spectrum 
wireless  communications  systems  with  limited  data  support  dictated  by  the  rapidly  changing  char¬ 
acteristics  of  the  wireless  channel.  Both  linear  and  non-linear  (neural  network)  receiver  structures 
are  considered,  usually  equipped  with  an  antenna  array  (space-time  processing).  A  total  of  18 
publications  have  been  supported  by  this  grant  which  are  listed  at  the  end  of  this  summary.  Our 
developments  are  presented  in  Chapters  1  through  11  and  are  summarized  below. 


In  Chapter  1,  based  on  statistical  conditional  optimization  criteria,  we  develop  an  iterative 
algorithm  that  starts  from  the  matched  filter  (or  constraint  vector)  and  generates  a  sequence  of 
filters  that  converges  to  the  minimum-variance-distortionless-response  (MVDR)  solution  for  any 
positive  definite  input  autocorrelation  matrix.  Computationally,  the  algorithm  is  a  simple  recur¬ 
sive  procedure  that  avoids  explicit  matrix  inversion,  decomposition,  or  diagonalization  operations. 
When  the  input  autocorrelation  matrix  is  replaced  by  a  conventional  sample-average  (positive  def¬ 
inite)  estimate,  the  algorithm  effectively  generates  a  sequence  of  MVDR  filter  estimators:  The  bias 
converges  rapidly  to  zero  and  the  covariance  trace  rises  slowly  and  asymptotically  to  the  covari¬ 
ance  trace  of  the  familiar  sample-matrix-inversion  (SMI)  estimator.  For  short  data  records,  early, 
non-asymptotic,  elements  of  the  generated  sequence  of  estimators  offer  favorable  bias/covariance 
balance  and  are  seen  to  outperform  in  mean-square  estimation  error  constraint-LMS,  RLS-type, 
orthogonal  multistage  decomposition,  as  well  as  plain  and  diagonally  loaded  SMI  estimates. 

The  problem  of  selecting  the  most  successful  (in  some  appropriate  sense)  filter  estimator  in 
the  sequence  for  a  given  data  record  is  also  addressed  and  two  data-driven  selection  criteria  are 
proposed.  The  first  criterion  minimizes  the  cross-validated  sample  average  variance  of  the  filter 
estimator  output.  The  second  criterion  maximizes  the  estimated  J-divergence  of  the  filter  esti¬ 
mator  output  conditional  distributions.  Illustrative  interference  suppression  examples  are  followed 
throughout  this  presentation.  The  material  presented  in  this  chapter  has  been  published  in  [Al] . 


In  Chapter  2  we  investigate  the  user  capacity,  throughput,  and  delay  characteristics  of  a  mo¬ 
bile  slotted  ALOHA  direct-sequence  code-division-multiple-access  (DS-CDMA)  link  with  dedicated 
signatures  under  multipath  fading  and  packet-rate  adaptive  antenna  array  signal  reception.  For  a 
given  system  transmission  bit  rate,  the  packet  size  is  designed  to  be  sufficiently  small  to  conform 
with  the  coherence  time  of  the  channel.  Then,  on  an  individual  packet-by-packet  basis  a  phase- 
ambiguous  spatial-temporal  channel  estimate  is  produced  by  a  blind  (unsupervised)  eigen-subspace 
procedure.  The  space-time  channel  estimate  is  phase  corrected  via  a  few  pilot  packet  mid-amble 
bits  and  used  for  joint  spatial-temporal  multiple-access-interference  (MAI)  suppression  according  to 
the  principles  of  auxiliary-vector  filtering.  Subsequently,  packet  success  probabilities  are  derived  in 
the  presence  or  absence  of  forward  error  correction  (FEC)  and  are  used  to  evaluate  the  throughput 
and  delay  characteristics  of  the  link.  The  material  presented  in  this  chapter  has  appeared  in  [A2]. 
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In  Chapter  3,  we  consider  a  multi-layer  perceptron  neural  network  receiver  architecture  for 
the  recovery  of  the  information  bits  of  a  direct-sequence  code-division-multiple-access  (DS-CDMA) 
user.  We  develop  a  fast  converging  adaptive  training  algorithm  that  minimizes  the  bit  error  rate 
(BER)  at  the  output  of  the  receiver.  The  adaptive  algorithm  has  three  key  features:  (i)  it  incorpo¬ 
rates  the  BER,  i.e.  the  ultimate  performance  evaluation  measure,  directly  into  the  learning  process, 
(ii)  it  utilizes  constraints  that  are  derived  from  the  properties  of  the  optimum  single-user  decision 
boundary  for  AWGN  multiple-access  channels,  and  (in)  it  embeds  importance  sampling  principles 
directly  into  the  receiver  optimization  process.  Simulation  studies  illustrate  the  BER  performance 
of  the  proposed  scheme.  The  material  presented  in  Chapter  3  has  been  published  in  [A3],  [A4] . 


In  Chapter  4,  we  investigate  the  relative  output  signal-to-interference-plus-noise  ratio  (SINR) 
performance  of  two  linear  direct-sequence  code-division- multiple-access  (DS/CDMA)  multiuser  de¬ 
tectors:  the  full  decorrelator  and  the  partial  decorrelator.  We  derive  necessary  and  sufficient 
conditions  on  the  system  parameters  under  which  the  partial  decorrelator  outperforms  the  full 
decorrelator  in  the  output  SINR  sense.  As  a  side  study,  we  consider  a  blind  implementation  of  the 
full  decorrelator  that  is  based  on  eigendecomposition  of  the  interference-plus-noise  autocovariance 
matrix  and  can  be  easily  modified  to  provide  a  partial  decorrelator.  Simulation  studies  illustrate 
the  relative  SINR  and  BER  performance  of  the  full  and  partial  decorrelator  under  perfectly  known 
and  sample-average-estimated  input  statistics.  The  material  presented  in  this  chapter  has  been 
published  in  [A5],  [A6]. 


The  problem  under  consideration  in  Chapter  5  is  the  adaptive  reception  of  a  spread-spectrum 
(SS)  signal  in  the  presence  of  unknown  correlated  SS  interference  and  additive  impulsive  noise.  The 
general  receiver  structure  is  composed  of  an  adaptive  chip-based  non-linear  pre-processor  followed 
by  an  adaptive  linear  post-processor.  We  develop  two  blind  algorithms  for  the  adaptation  of  the 
parameters  of  the  non-linear  pre-processor;  the  first  algorithm  couples  itself  with  the  adaptation  of 
the  linear  post-processor,  while  the  second  algorithm  is  a  decoupled  procedure  that  is  independent 
of  the  linear  post-processor.  The  material  presented  in  this  chapter  has  been  published  in  [A7],  [A8] . 


In  Chapter  6,  we  analyze  the  performance  of  DS-CDMA  single-user  antenna-array  receivers 
that  utilize  either  a  cascade  of  spatial  and  temporal  linear  filters  or  one  joint  space-time  linear 
filter.  We  consider  both  MF-type  and  MMSE/MVDR-type  filters,  and  we  use  pre-detection  signal- 
to-interference- plus-noise-ratio  (SINR)  as  the  figure  of  merit.  Our  goal  is  twofold:  (i)  Quantify  the 
receiver  output  variance  and  receiver  output  SINR  performance  of  disjoint  and  joint  space-time 
configurations  given  ideal  input  statistics;  (ii)  quantify  the  confidence  level  in  a  certain  neighbor¬ 
hood  of  the  optimal  performance  point  of  the  corresponding  estimated  structures  that  are  evaluated 
based  on  limited  sample  support  due  to  a  fast  frequency-nonselective  fading  channel.  These  devel¬ 
opments  have  been  reported  in  [A9]. 


The  prohibitive  -exponential  in  the  number  of  users-  computational  complexity  of  the  maximum 
likelihood  (ML)  multiuser  detector  for  direct-sequence  code-division-multiple-access  (DS/CDMA) 
communications  has  fueled  an  extensive  research  effort  for  the  development  of  low  complexity  mul¬ 
tiuser  detection  alternatives.  In  Chapter  7,  we  show  that  we  can  efficiently  and  effectively  approach 
the  error  rate  performance  of  the  optimum  multiuser  detector  as  follows.  We  utilize  a  multiuser 
zero-forcing  or  minimum-mean-square-error  (MMSE)  linear  filter  as  a  pre-processor  and  we  estab- 
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lish  that  the  output  magnitudes,  when  properly  scaled,  provide  a  reliability  measure  for  each  user 
bit  decision.  Then,  we  prepare  an  ordered  reliability-based  error  search  sequence  of  length  linear 
in  the  number  of  users  that  returns  the  most  likely  user  bit  vector  among  all  visited  options.  Nu¬ 
merical  and  simulation  studies  for  moderately  loaded  systems  that  permit  exact  implementation  of 
the  optimum  detector  indicate  that  the  error  rate  performance  of  the  optimum  and  the  proposed 
detector  are  nearly  indistinguishable  over  the  whole  pre-detection  signal-to- noise  ratio  (SNR)  range 
of  practical  interest.  Similar  studies  for  higher  user  loads  (that  prohibit  comparisons  with  the  op¬ 
timum  detector)  demonstrate  error  rate  performance  gains  of  orders  of  magnitude  in  comparison 
with  straight  decorrelating  or  MMSE  multiuser  detection.  The  material  presented  in  this  chapter 
has  been  published  in  [A10]-[A12], 


In  Chapter  8  we  consider  blind  adaptive  linear  receivers  for  the  demodulation  of  direct-sequence 
code-division-multiple-access  (DS/CDMA)  signals  in  asynchronous  transmissions.  The  proposed 
structures  are  self-synchronized  in  the  sense  that  adaptive  synchronization  and  demodulation  are 
viewed  and  treated  as  an  integrated  receiver  operation.  Two  computationally  efficient  combined 
synchronization/demodulation  schemes  are  proposed,  developed  and  analyzed.  The  first  scheme 
is  based  on  the  principles  of  minimum-variance-distortionless-response  (MVDR)  processing,  while 
the  second  scheme  follows  the  principles  of  auxiliary-vector  filtering  and  exhibits  enhanced  per¬ 
formance  in  short  data  record  scenarios.  In  both  cases  the  resulting  receiver  is  a  linear  structure 
of  order  exactly  equal  to  the  system  processing  gain.  Simulation  studies  included  in  this  chapter 
demonstrate  the  coarse  synchronization  as  well  as  the  bit-error-rate  performance  of  the  proposed 
strategies.  The  material  presented  in  this  chapter  has  appeared  in  [A13]. 


Chapter  9  deals  with  the  theoretical  finite  data-record  performance  analysis  of  the  develop¬ 
ments  in  Chapter  8.  In  this  chapter  we  investigate  the  coarse  synchronization  performance  of 
blind  adaptive  linear  self- synchronized  receivers  for  asynchronous  direct-sequence  code-division- 
multiple-access  communications  under  finite  data  record  adaptation.  Based  on  transformation 
noise  modeling  techniques,  three  alternative  methods  are  developed  leading  to  analytic  expres¬ 
sions  that  approximate  the  probability  of  coarse  synchronization  error  of  matched-filter-type  (MF) 
and  minimum-variance-distortionless-response-type  (MVDR)  receivers.  The  expressions  are  ex¬ 
plicit  functions  of  the  data  record  size  and  the  filter  order  and  reveal  the  effect  of  short-data-record 
sample-matrix-inversion  (SMI)  implementations  on  the  coarse  synchronization  performance.  Be¬ 
sides  their  theoretical  interest,  the  derived  expressions  provide  simple,  highly-accurate  alternatives 
to  computationally  demanding  performance  evaluation  through  simulations.  The  effect  of  the  data 
record  size  on  the  probability  of  coarse  synchronization  error  is  further  quantified  through  the  use  of 
a  receiver  synchronization  resolution  metric.  Numerical  and  simulation  studies  examine  the  accu¬ 
racy  of  the  theoretical  developments  and  show  that  the  derived  expressions  approximate  closely  the 
actual  coarse  synchronization  performance.  The  material  presented  in  this  chapter  has  appeared 
in  [A14], 


In  Chapter  10  we  address  the  problem  of  navigation  data  demodulation  by  an  adaptive  GPS 
receiver  that  utilizes  a  bank  of  single-satellite  linear-tap-delay  filters  and  employs  antenna-array 
reception.  The  presence  of  an  antenna  array  allows  the  receiver  to  operate  in  the  spatial  do¬ 
main  in  addition  to  the  temporal  (code)  domain.  We  investigate  disjoint-domain  as  well  as  joint- 
domain  space-time  GPS  signal  processing  techniques  and  we  consider  design  criteria  of  conventional 
matched-filter  (MF)  type,  minimum- variance-distortionless-response  (MVDR)  type  and  auxiliary- 
vector  (AV)  type.  The  proposed  structures  utilize  filters  that  operate  at  a  fraction  of  the  navigation 
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data  bit  period  (1  msec)  and  are  followed  by  hard-decision  detectors.  Hard  decisions  taken  over 
a  navigation  data  bit  period  are  then  combined  according  to  a  simple  combining  rule  for  further 
bit-error-rate  (BER)  performance  improvements.  Analytic,  numerical  and  simulation  comparisons 
illustrate  the  relative  merits  of  the  investigated  design  alternatives.  The  material  presented  in  this 
chapter  has  been  published  in  [A15],  [A16] . 


Finally  in  Chapter  11,  we  develop  a  new  low  complexity  algorithm  for  decoding  low-density 
parity-check  (LDPC)  codes.  The  developments  are  oriented  specifically  toward  the  low  cost  -yet 
effective-  decoding  of  (high  rate)  finite  geometry  LDPC  codes.  The  decoding  procedure  updates  the 
hard-decision  received  vector  iteratively  in  search  of  a  valid  codeword  in  the  vector  space.  Only  one 
bit  is  changed  in  each  iteration  and  the  bit  selection  criterion  combines  the  number  of  failed  checks 
and  the  reliability  of  the  received  bits.  Prior  knowledge  of  the  signal  amplitude  and  noise  power 
is  not  required.  An  optional  mechanism  to  avoid  infinite  loops  in  the  search  is  also  proposed.  Our 
studies  show  that  the  algorithm  achieves  an  appealing  performance  versus  complexity  trade-off  for 
finite  geometry  LDPC  codes.  The  material  presented  in  this  chapter  has  appeared  in  [A17],  [A18]. 
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Chapter  1 

Short-data-record  Adaptive  Filtering: 
The  Auxiliary- Vector  Algorithm 


1.1  Introduction 

Minimum-variance-distortionless-response  (MVDR)  filtering  refers  to  the  problem  of  identifying  a  linear 
filter  that  minimizes  the  variance  at  its  output,  while  at  the  same  time  the  filter  maintains  a  distortionless 
response  toward  a  specific  input  vector  direction  of  interest.  If  r  is  a  random,  zero  mean  without  loss  of 
generality,  complex  input  vector  of  dimension  L,  r  G  CL,  that  is  processed  by  an  L-tap  filter  w  E  CL ,  then 
the  filter  output  variance  is  wffRw,  where  R  =  E{rrH}  is  the  input  autocorrelation  matrix  ( E{ •}  denotes 
the  statistical  expectation  operation  and  denotes  the  Hermitian  that  is,  transpose  conjugate  of  x).  The 
MVDR  filter  minimizes  wflRw  and  simultaneously  satisfies  wHv  =  1,  or  more  general  wHv  =  p  E  C,  where 
v  G  CL  is  the  signal  vector  direction  to  be  protected.  In  this  set-up,  MVDR  filtering  is  a  standard  linear 
constraint  optimization  problem  and  a  conventional  Lagrange  multipliers  procedure  leads  to  the  well  known 
solution  [1],  [2] 

R_1v 

Wmvdr  =  p\hR- iv  (L1) 

where  p*  denotes  the  conjugate  of  the  desired  response  wHv  =  p.  MVDR  filtering  has  been  used  extensively 
in  unsupervised  signal  processing  applications  where  a  desired  scalar  filter  output  d  E  C  cannot  be  identified 
or  cannot  be  assumed  available  for  each  input  r  E  CL  (for  example  in  radar  and  array  processing  problems 
where  the  constraint  vector  v  is  usually  referred  to  as  the  “target”  or  “look”  direction  of  interest).  We  may 
also  observe  the  close  relationship  between  the  MVDR  filter  and  the  minimum-mean-square-error  (MMSE) 
or  Wiener  filter.  Indeed,  if  the  constraint  vector  v  is  chosen  to  be  the  statistical  cross-correlation  vector 
between  the  desired  output  d  and  the  input  vector  r,  that  is  if  v  =  E{rd*},  then  the  MVDR  and  MMSE 
filters  become  scaled  versions  of  each  other,  cR_1v,  c  £  C.  For  this  reason,  in  the  rest  of  the  chapter  we  will 
use  the  term  “MMSE/MVDR  filter”  to  refer  to  either  filter. 

In  this  work,  first  we  present  an  iterative  algorithm  for  the  calculation  of  the  MMSE/MVDR  vector 
wmmse/mvdr  in  ( 1.1).  The  algorithm  is  a  non-invasive  procedure  where  no  explicit  matrix  inversion/eigen- 
decomposition/diagonalization  is  attempted.  The  MMSE/MVDR  computation  algorithm  creates  a  sequence 
of  filters  w„,  n  =  0, 1,  2, . . .,  that  begins  from  w0  =  ]yjprv  and  converges  to  the  MMSE/MVDR  filter 
(wqo  =  wmmse/Mvdr)-  At  each  step  n  =  1,2,...,  w„  is  given  as  a  simple,  direct  function  of  R,  v,  and  w„_i. 

The  development  of  the  iterative  algorithm  (which  we  call  “the  auxiliary-vector  algorithm”  for  reasons 
that  will  become  apparent  in  the  sequel)  is  founded  solely  on  statistical  signal  processing  principles.  The 
motivation  behind  its  development  is  adaptive  signal  processing  where  the  input  autocorrelation  matrix  R 
is  assumed  unknown  and  it  is  sample-average  estimated  by  a  data  record  of  M  points,  iq,  r2, . . . ,  r m- 
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M 


R-( M )  =  r™r™- 

m= 1 


(1.2) 
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When  R  is  substituted  by  R(M)  in  the  recursively  generated  sequence  of  filters  w„,  n  =  0,1,2,...,  the 
corresponding  filter  estimators  w„(M),  n  =  0, 1,  2, ... ,  offer  the  means  for  effective  control  over  the  filter 
estimator  bias  versus  (co-) variance  trade-off  [3].  Starting  from  the  0- variance,  high- bias  (for  non- white 
inputs)  wo (M)  =  pyp-v  estimate,  we  can  go  all  the  way  up  to  the  unbiased,  yet  high-variance  for  small 
data  record  sizes  M,  w oo{M)  estimate  and  anywhere  in  between,  w„(M),  1  <  n  <  oo.  As  a  result, 
adaptive  filters  from  this  newly  developed  class  can  be  seen  to  outperform  in  expected  norm-square  estimation 
error,  E{\\w(M)  —  wMMSE/MVDR||2},  (constraint-)LMS  [4],  sample-matrix-inversion  (SMI)  [5]  with  or  without 
diagonal  loading  [6],  RLS-type  [7],  [8],  and  orthogonal  multistage  decomposition  [9],  [10]  adaptive  filter 
implementations.  It  is  worth  mentioning  that  the  familiar  trial-and-error  tuning  to  problem  and  data- 
record-size  specifics  of  the  real- valued  LMS  gain  or  RLS  inverse  matrix  initialization  constant  or  SMI  diagonal 
loading  parameter  that  plagues  field  practitioners  is  now  replaced  by  an  integer  choice  of  one  of  the  recursively 
generated  filters. 

The  problem  of  selecting  the  best  (in  some  appropriate  sense)  filter  estimator  in  the  sequence  for  a  given 
data  record  is  addressed  and  two  data-driven  selection  criteria  are  proposed.  The  first  criterion  is  rather 
general  and  is  motivated  by  the  asymptotic  minimum  output  variance  property  of  the  generated  sequence 
of  filter  estimators.  In  particular,  for  a  given  data  record,  we  select  the  filter  estimator  that  has  minimum 
cross- validated  average  output  variance  (energy).  The  second  rule  is  built  specifically  for  binary  antipodal 
(BPSK-type)  communication  signals  and  is  related  to  the  objective  of  achieving  maximum  stochastic  distance 
between  the  two  conditional  distributions  of  the  filter  estimator  output.  Under  this  rule,  we  choose  the 
filter  estimator  in  the  sequence  that  exhibits  maximum  estimated  J-divergence  of  the  conditional  output 
distributions.  We  pursue  and  analyze  both  supervised  and  unsupervised  (blind)  implementations  of  this 
criterion.  Illustrative  case  studies  drawn  from  the  code-division-multiple-access  (CDMA)  communications 
literature  are  followed  throughout  this  work. 

The  rest  of  the  chapter  is  organized  as  follows.  In  Section  1.2  we  present  the  basic  algorithmic  develop¬ 
ment  and  analysis  results.  Filter  estimation  issues  are  discussed  in  Section  1.3.  The  two  data-driven  criteria 
for  the  selection  of  a  filter  estimator  from  the  generated  sequence  are  developed  in  Section  1.4.  In  Section  1.5 
we  examine  the  quality  of  the  proposed  criteria  through  simulations.  A  few  concluding  remarks  are  given  in 
Section  1.6. 


1.2  Algorithmic  Developments  and  Convergence  Analysis 

For  a  given  constraint  vector  vGC1  consider  the  set  of  filters  V  =  {w  6  CL  :  w  =  v  +  u,  u  £  CL  and 
vHu  =  0}.  V  is  the  class  of  all  filters  w  in  CL  that  are  distortionless  in  v,  that  is  wHv  =  p.  In  this  section 
we  develop  an  iterative  algorithm  for  the  computation  of  the  u  component  of  the  MMSE/MVDR  filter. 

Algorithmic  designs  that  focus  on  the  MMSE/MVDR  filter  part  u  that  is  orthogonal  to  the  constraint  vec¬ 
tor,  or  “look” ,  direction  v  have  been  widely  pursued  in  the  array  processing  literature  and  have  been  known  as 
generalized  sidelobe  cancelers  (GSC)  [11]  or  partially  adaptive  beamformers  [12].  Recent  developments  have 
been  influenced  by  Principal  Component  Analysis  reduced-rank  processing  principles  [13].  In  general,  the 
MMSE/MVDR  filter  part  u  (uHv  =  0)  has  been  approximated  by  ulxi  ~  —  B£X(£_i)T(£_!)xpwGxG,  where 
B  is  the  so  called  “blocking  matrix”  that  satisfies  Bffv  =  Ol-i  (B  is  a  full  column-rank  matrix  that  can  be 
derived  by  Gram-Schmidt  orthogonalization  of  an  L  x  L  orthogonal  projection  operator  such  as  I  —  yj^jp, 
where  I  is  the  identity  matrix),  T  is  the  rank  reducing  matrix  with  1  <  p  <  L  —  1  columns  to  be  designed,  and 
wGSC  is  the  MS-optimum  vector  of  weights  of  the  p  columns  of  T  (wGSC  =  |^jp5-[TifBffRBT]_1TifBffRv 
[12]).  In  [14]  and  [15]  the  p  columns  of  T  were  chosen  to  be  the  p  maximum  eigenvalue  eigenvectors  of 
the  blocked  data  autocorrelation  matrix  BffRB.  If,  however,  the  columns  of  T  have  to  be  eigenvectors  of 
BffRB,  then  the  best  way  to  choose  them  in  the  minimum  output  variance  p-rank  approximation  sense  was 
presented  in  [16]:  Select  the  p  eigenvectors  q;  of  BffRB,  with  corresponding  eigenvalues  Xi,  that  maximize 
,  i  =  l, ...  ,p.  This  design  algorithm  has  also  been  known  as  “cross-spectral  metric”  reduced-rank 
processing  [17].  A  different  approach  from  a  different  point  of  view  is  described  in  this  work.  A  conditional 
statistical  optimization  procedure  is  shown  to  offer  the  means  for  exact  computation  of  u  as  the  convergence 
point  of  an  infinite  series  of  the  form  —  P«gn,  pn  £  77+,  g„  £  CL  and  g(/v  =  0,  Vn  =  1,  2, . . .  . 

We  begin  the  algorithmic  developments  from  the  conventional  matched  filter  (MF)  with  desired  response 
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Fig.  1.1:  Block  diagram  representation  of  the  iteratively  generated  sequence  of  filters 
w0,  wi,  w2, .... 


wHv  =  p 

w° =  ri2v>  (f-3) 

\\v\\ 

which  is  MMSE/MVDR  optimum  for  white  CL  vector  inputs  (when  R  =  cr2I,  a  >  0).  We  recall  that, 
w.l.o.g.  and  for  notational  simplicity,  we  assume  throughout  this  presentation  that  the  input  vectors  r  e  CL 
are  zero  mean.  Next,  we  incorporate  in  w0  an  “auxiliary”  vector  component  that  is  orthogonal  to  v  and  we 
form  (Fig.  1.1) 

P* 

wi  =  w0  -  /qgi  =  7j  rfo  v  -  Migi  (1-4) 

llVll 

where  gi  G  CL  —  {o},  Pi  £  C,  and  g^v  =  0.  We  assume  for  a  moment  that  the  orthogonal  auxiliary 
vector  gi  is  arbitrary  but  non- zero  and  fixed  and  we  concentrate  on  the  selection  of  the  scalar  pi .  The  value 
of  pi  that  minimizes  the  variance  of  the  output  of  the  filter  wi  can  be  found  by  direct  differentiation  of 
E{ |w;PF|2}  or  simply  as  the  value  that  minimizes  the  MS  error  between  w^r  =  jy/p-v^r  and  pi gf^r.  This 
is  essentially  a  scalar  version  of  the  GSC  weight  determination  problem  and  we  present  the  solution  in  the 
form  of  a  proposition  [18]: 


Proposition  1.1  The  scalar  pi  that  minimizes  the  variance  at  the  output  of  wi  or  equivalently  minimizes 
the  MS  error  between  w(/r  =  ypp-v^r  and  plg^r  is 


=  gfRw0 

gfRg 


where  R  =  E{rrH }  is  the  input  autocorrelation  matrix. 


□ 


Since  gi  is  set  to  be  orthogonal  to  v,  (1.5)  shows  that  if  the  vector  Rwo  happens  to  be  “on  v”  (that  is 
Rw0  =  V||Jj|7°v  or  equivalently  (I  —  ppp)Rwo  =  0),  then  pi  =  0.  Indeed,  if  Rw0  =  V||Jj|7°  v  then  w0  is 
already  the  MMSE/MVDR  filter.  To  avoid  this  trivial  case  and  continue  with  our  developments,  we  suppose 
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that  Rwo  ^  v  v.  By  inspection,  we  also  observe  that  for  the  MS-optimum  value  of  pi  the  product 
pigi  =  K'),l,i,Wf'  gi  is  independent  of  the  norm  of  gi .  Hence,  so  is  w, .  At  this  point,  we  decide  to  choose  the 

§1  ^gl 

auxiliary  vector  gi  as  the  normalized  vector  that  maximizes  the  magnitude  of  the  cross-correlation  between 
wjf r  =  |rAp-vffr  and  gf^r,  under  the  constraint  that  gf^v  =  0  and  gf^gi  =  1: 


gi  =  arg  max 

g 


E  jw^r  (gHr)*  j|  =  arg  max  |w,fRg| 
subject  to  gHv  =  0  and  gHg  =  1. 


(1.6) 


For  the  sake  of  mathematical  accuracy,  we  note  that  both  the  criterion  function  |w^Rg|  to  be  maximized 
as  well  as  the  orthogonality  constraints  are  phase  invariant.  In  other  words,  if  gi  satisfies  (1.6)  so  does 
gie-7^  for  any  phase  (f>.  Without  loss  of  generality,  to  avoid  any  ambiguity  in  our  presentation  and  to  have  a 
uniquely  defined  auxiliary  vector,  we  seek  the  one  and  only  auxiliary  vector  that  satisfies  (1.6)  and  places  the 
cross-correlation  value  on  the  positive  real  line  (wjf  Rg  >  0).  This  constraint  optimization  problem  was  first 
posed  and  solved  in  [19]  where  the  filter  wi  in  (1.4)  was  used  for  multiple  access  interference  suppression  in 
multipath  CDMA  communication  channels.  Intuitively,  the  maximum  magnitude  cross-correlation  criterion 
as  defined  in  (1.6)  strives  to  identify  the  auxiliary  vector  orthonormal  to  v  that  can  capture  the  most 
interference  present  in  w^r.  The  solution,  derived  through  conventional  Lagrange  multipliers  optimization, 
is  given  below. 


Proposition  1.2  Suppose  that  (I  —  ppp^Rwg  ^  0  (w0  ^  wMMSE/MVDR).  Then,  the  auxiliary  vector 


gi 


Rw(J  — 
Rwo  — 


hRwo 

v||2 


H  Rwq 
v||2 


(1.7) 


maximizes  the  magnitude  of  the  cross-correlation  between  w^r  =  and  g^r,  |w^Rgi|,  subject  to 

the  constraints  g^v  =  0  and  g^gi  =  1  .In  addition,  w^Rgi  is  real  positive  (w^Rgi  >0).  □ 


So  far  we  have  defined  wo  in  (1.3)  and  wi  in  (1.4)  with  gi  and  fi\  given  by  (1.7)  and  (1.5),  respectively. 
The  iterative  algorithm  for  the  generation  of  an  infinite  sequence  of  filters  wo,  wi,W2, ...  is  already  taking 
shape.  Formally,  we  just  need  to  specify  the  inductive  step.  Assuming  that  the  filter  w„  =  p([ypv  —  X)"=1  Tigi 
has  been  identified  for  some  n  >  1  and  w„  ^  wmmse/Mvdr,  we  argue  as  in  Propositions  1  and  2  and  we 
define  w„+i  as  follows: 

wn+i  =  w„  -  pn+1gn+1  (1.8) 


where 


Rw„  - 


v  Rw„ 


gn+l  “ 


v*Rw„ti 
KW,,  II. .112  v 


(1.9) 


is  the  orthonormal  with  respect  to  v  auxiliary  vector  that,  given  w„ 


correlation  magnitude 


E 


{w^r(g^+1r)*} 


th 


Rg 


,n+ 1 


and 


maximizes  conditionally  the  cross- 


/^n+1  — 


g»+iRw» 

g^+lRgn+l 


(1.10) 


is  the  scalar  that  minimizes  the  MS  error  between  w^r  and  /i*+1g^+1r  (minimizes  E{\w^+lv\2}). 

It  is  important  to  note  that,  while  the  generated  auxiliary  vectors  gi,g2,...  are  all  constrained  to  be 
orthogonal  to  v,  orthogonality  among  the  auxiliary  vectors  is  not  imposed  [20],  [21].  This  is  in  sharp  contrast 
to  previous  work  that  involved  filtering  with  up  to  L  —  1  orthogonal  to  each  other  and  to  v  vectors  [22] -[24], 
where  L  is  the  data  input  vector  dimension.  We  observe,  however,  that  successive  auxiliary  vectors  generated 
by  the  above  recursive  conditional  optimization  procedure  ( 1.8)-(  1. 10)  do  come  up  orthogonal:  g,fg,j+i  =  0, 
Vn  =  1,2,3, .. .  (while  g^gm  7^  0,  Vn,  m,  | n  —  m\  7^  1).  For  completeness  purposes,  below  we  present  this 
observation  in  the  form  of  a  Lemma. 
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Auxiliary-Vector  Algorithm 

Input: 

Autocovariance  matrix  R,  constraint  vector  v, 
desired  response  wHv  =  p  . 


Initialization: 


m  ■ 


Iterative  computation: 

For  n=1 , 2,  .  .  .  do 
begin 

9n  :=  (I  -™")Rwn, 
if  gn  =  0  then  EXIT 

|Xn  := 

w n  :=  w n_i  -|xngn 
end 


9n  R  Wn-1 

gnH  R  g„ 


Output: 

Filter  sequence  w0,  w1 ,  w2, 


Fig.  1.2:  The  algorithm  for  the  iterative  generation  of  the  filter  sequence  wq,  wj,  W2, 
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Lemma  1.1  Successive  auxiliary  vectors  generated  through  (1.8)-(1.10)  are  orthogonal:  g^gn+i  =  0,  n  = 
1,2,3,...  .  However,  g^gm^O,  Vn, m,  |n— m|^l.  □ 


The  algorithm  is  summarized  in  Fig.  1.2.  The  conceptual  simplicity  of  the  conditional  statistical  opti¬ 
mization  process  led  to  a  computationally  simple  recursion.  In  Fig.  1.2  we  chose  to  drop  the  unnecessary,  as 
previously  explained,  normalization  of  the  auxiliary  vectors  and  we  also  factorized  their  numerator  to  make 
the  orthogonal  projection  operator  apparent.  Formal  convergence  of  the  filter  sequence  wo,wi,W2,...  to 
the  MMSE/MVDR  filter  p*  V»R-/V  is  established  by  the  following  Theorem.  The  proof  can  be  found  in  [21]. 


Theorem  1.1  Let  R  be  a  Hermitian  positive  definite  matrix.  Consider  the  iterative  algorithm  of  Fig.  1.2. 

(i)  The  generated  sequence  of  auxiliary-vector  weights  {pn},  n  =  1,2,...,  is  real-valued,  positive,  and 
bounded: 

0<^ -  <A*n<v — ,  n=  1,2,...,  (1.11) 

/'max  /'min 

where  Amax  and  Amin  are  the  maximum  and  minimum,  correspondingly,  eigenvalues  of  R. 

(ii)  The  sequence  of  auxiliary  vectors  {g„},  n=  1,2,...,  converges  to  the  0  vector: 

limg„=0.  (1.12) 

n— too 


(iii)  The  sequence  of  auxiliary-vector  filters  {w„} ,  n  =  1,2,...,  converges  to  the  MMSE/MVDR  filter: 


lim  w„ 

n— too 


=  P 


R-1v 
vffR-1v  ’ 


(1.13) 

□ 


We  conclude  this  section  with  an  illustration.  We  draw  a  signal  model  example  from  the  direct-sequence 
code-division-multiple-access  (DS-CDMA)  communications  literature  and  we  assume  a  synchronous  system 
where  the  input  signal  vector  r  6  1ZL  is  given  by 

K 

r  =  ^  \fEkbk sk  +  n.  (1.14) 

k= 1 

In  this  set-up,  K  denotes  the  total  number  of  signals  ( “users” )  present  and  each  signal  is  defined  through  an 
L-dimensional,  normalized,  binary-antipodal  vector  waveform  (or  “user  signature”)  s/,,  k  =  1,2,...,  K.  The 
signature  vector  dimension  L  is  usually  referred  to  as  the  system  “spreading  gain” .  With  respect  to  the  fc-th 
user  signal,  E /  is  the  received  signal  energy  and  bk  6  {  —  1,  +1}  is  the  information  bit  modeled  as  a  random 
variable  with  equally  probable  values  and  assumed  to  be  statistically  independent  from  all  other  user  bits 
bj ,  j  /  k.  Additive  white  Gaussian  noise  contributions  are  accounted  for  by  n  with  autocorrelation  matrix 
E{ nnT}  =  ct2Ilxl  (xt  denotes  the  transpose  of  x).  With  this  notation  and  normalized  user  signatures,  the 
signal-to-noise  ratio  of  the  A:-th  user  signal  is  defined  by  SNR*,  =  101og10  |/rdB,  A:  =  1,  2, ... ,  K. 

MMSE/MVDR  filtering  for  DS-CDMA  type  problems  has  attracted  significant  interest  [25]-[29].  If 
we  wish  to  recover  the  information  bits  of,  say,  user  1,  then  all  other  signals  constitute  multiple-access 
interference  and  the  MMSE/MVDR  filter  is  built  with  constraint  vector  v  =  Si,  desired  response  wTSi  =  1, 
and  autocorrelation  matrix  R  =  Eksksk  +  <72L  We  choose  L  =  32,  K  =  13,  and  we  draw  an  arbitrary 

set  of  signatures  si,  S2,  . . .,  S13.  For  purposes  of  completeness  in  presentation,  the  exact  signature  assignment 
is  given  in  the  Appendix.  We  fix  the  SNR  of  the  user  of  interest  at  SNRi  =  12dB  while  the  “interferers” 
k  =  2, . . . ,  13  are  at  SNR.2_s  =  lOdB,  SNRg-g  =  12dB  and  SNR10-13  =  14dB.  Fig.  1.3  shows  how 
the  sequence  of  filters  wq,  wj,  . . .  generated  by  the  algorithm  in  Fig.  1.2  converges  to  the  MMSE/MVDR 
solution.  The  convergence  is  captured  in  terms  of  the  norm-square  metric  ||w„  —  wMMSE/MVDR||2  as  a  function 
of  the  iteration  step  (index  of  the  AV  filter  in  the  sequence  or  number  of  auxiliary  vectors  used)  n. 


1.3  Filter  Estimation 

Consider  a  constraint  vector  v  and  a  Hermitian  positive  definite  autocorrelation  matrix  R  of  an  input  vector 
r  G  CL .  Assume  that  R  is  in  fact  unknown  and  it  is  sample-average  estimated  from  a  data  record  of  M 


10 


2.5 


Fig.  1.3:  Convergence  of  the  sequence  of  filters  w„,  n  =  0, 1,  2, . . .,  to  the  MMSE/MVDR  solution 
for  the  signal  model  example  in  (1.14). 


points:  R(M)  =  ±  YZ=i  r™rm-  For  Gaussian  inputs,  R (M)  is  a  maximum-likelihood  (ML),  consistent, 
unbiased  estimator  of  R  [3],  [30].  For  a  large  class  of  multivariate  elliptically  contoured  input  distributions 
that  includes  the  Gaussian,  if  M  >  L  then  R (M)  is  positive  definite  (hence  invertible)  with  probability  1 
(w.p.  1)  [31]- [33] .  Then,  Theorem  1.1  in  Section  1.2  shows  that 


w  „(M) 


Woo  (M)  =  p* 


■ 

J 

R  (M) 

> 

1 

R(M) 

1  _1 

1  V 

(1.15) 


where  Wqo(M)  is  the  widely  used  MMSE/MVDR  filter  estimator  known  as  the  sample-matrix- inversion 
(SMI)  filter  [5]. 

The  output  sequence  begins  from  wo  (M)  =  which  is  a  0- variance,  fixed- valued,  estimator  that 

may  be  severely  biased  (wo (M)  =  v  7^  wMmse/mvdr)  unless  R  =  u2I,  for  some  a  >  0.  In  the  latter  trivial 
case,  w o(M)  is  already  the  perfect  MMSE/MVDR  filter.  Otherwise,  the  next  filter  estimator  in  the  sequence, 
wi(M),  has  a  significantly  reduced  bias  due  to  the  optimization  procedure  employed,  at  the  expense  of  non¬ 
zero  estimator  (co-)variance.  As  we  move  up  in  the  sequence  of  filter  estimators  w„(M),  n  =  0,1,2,..., 
the  bias  decreases  rapidly  to  zero1  while  the  variance  rises  slowly  to  the  SMI  (\v00(M))  levels  (cf.  (1.15)). 
To  quantify  these  remarks,  we  plot  in  Fig.  1.4  the  norm-square  bias  \\E  (w„(M)}  —  wMMse/mvdr||2  and 
the  trace  of  the  covariance  matrix  E  j[w„(M)  —  E{w„(Af)}]  [w„ ( M)  —  E  {wn(M)}]H  j  as  a  function  of 
the  iteration  step  n,  for  the  signal  model  example  of  Fig.  1.3  and  data  record  size  M  =  256.  Bias  and 
cov-trace  values  are  calculated  from  100000  independent  filter  estimator  realizations  for  each  iteration  point 
n.  Formal,  theoretical  statistical  analysis  of  the  generated  estimators  w„(M),  n  =  0,1,2,...,  is  beyond 
the  scope  of  this  presentation.  We  do  note,  however,  that  for  multivariate  elliptically  contoured  input 
distributions,  an  analytic  expression  for  the  covariance  matrix  of  the  SMI  estimator  (M)  can  be  found 

XThe  SMI  estimator  is  unbiased  for  multivariate  elliptically  contoured  input  distributions  [33],  [34]:  I?{woo(M)}  = 
WMMSE /MVDR  =  P*  vFfR-'lv  ’ 
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Numerical  Iteration  (number  of  auxiliary  vectors)  n 

Fig.  1.4:  Norm-square  bias  and  covariance  trace  for  the  sequence  of  estimators  w„(M),  n  =  0, 1, . . . 
.  The  signal  model  is  as  in  Fig.  1.3  and  M  =  256. 


in  [33]:  £;{[w00(M)  -F{w00(M)}]  [w ^(M)—  F{w00(M)}]i?}  =  ,^=4^  (R-1  -  • 

Since  under  these  input  distribution  conditions  Woo  (M)  is  unbiased,  the  trace  of  the  covariance  matrix  is 
the  MS  filter  estimation  error.  It  is  important  to  observe  that  the  covariance  matrix  and,  therefore,  the 
MS  filter  estimation  error  depend  on  the  data  record  size  M,  the  filter  length  L,  as  well  as  the  specifics  of 
the  signal  processing  problem  at  hand  (R  and  v).  It  is  also  important  to  note  that  for  the  CDMA  signal 
model  example  in  (1.14)  the  input  is  Gaussian-mixture  distributed.  Therefore,  the  analytic  result  in  [33]  is 
not  directly  applicable  and  can  only  be  thought  of  as  an  approximation  (a  rather  close  approximation  as  we 
concluded  in  our  studies).  In  any  case,  from  the  results  in  Fig.  1.4  for  M  =  256,  we  see  that  the  estimators 
wi (M),  W2 (M),  ...,  up  to  about  W20 (M)  are  particularly  appealing.  In  contrast,  the  estimators  w n(M) 
for  n  >  20  do  not  justify  their  increased  cov-trace  cost  since  they  have  almost  nothing  to  offer  in  terms  of 
further  bias  reduction. 

The  mean-square  estimation  error  expression  E  |||w„(M)  —  wMMSE/MVDR||2  j  captures  the  bias/ variance 

balance  of  the  individual  members  of  the  estimator  sequence  w „(M),  n  =  0, 1,  2, . . .  .  In  Fig.  1.5  we  plot 
the  MS  estimation  error  as  a  function  of  the  iteration  step  n  (index  of  AV  filter  in  the  sequence  or  number 
of  auxiliary  vectors)  for  the  case  study  in  Fig.  1.4,  for  M  =  256  (Part  (a))  and  M  =  2048  (Part  (b)).  As  a 
reference,  we  also  include  the  MS-error  of  the  constraint-LMS  estimator  [4],  [35] 

/  Vv^  \  p* 

WiMs(m)  =  yl  ~  jj^jp  J  [wLMs(m  -  1)  -  //rmr^wLMS(m  -  1)]  +  jj^jpv  >  (1-16) 

m  —  1, . . . ,  M, 


with  wLMS(0)  =  pjp  v  and  some  p  >  0,  and  the  RLS  estimator  [7],  [8]  with  Matrix- Inversion-Lemma-based 
R-1  estimation: 


R  1(m)  =  R  1(m  —  1)  — 


l(m  -  l)i 


V-1) 


L(?n  -  l)r„ 


m  =  1, . . 
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Fig.  1.6:  MS  estimation  error  versus  number  of  auxiliary  vectors  n  and  sample  support  M. 


with  R  1(0)  =  T-j  for  some  eo  >  0-  Theoretically,  it  is  known  that  the  LMS  gain  parameter  fi  >  0  [36]  has 
to  be  less  than  2<Abiocked  ?  where  A^Jked  is  the  maximum  eigenvalue  of  the  “bio eked- data”  autocorrelation 


matrix  (i  )  R  (l  ||v||2 


While  this  is  a  theoretical  upper  bound,  practitioners  are  well  aware 


that  empirical,  data- dependent  “optimization”  or  “tuning”  of  the  LMS  gain  fi  >  0  or  the  RLS  initialization 
parameter  6q  >  0  [37]  is  necessary  to  achieve  acceptable  performance  (in  our  study  we  set  n  =  blocked  and 
eo  =  20,  respectively).  This  data  specific  tuning  frequently  results  in  misleading,  over-optimistic  conclusions 
about  the  short-data- record  performance  of  the  LMS/RLS  algorithms.  In  contrast,  when  the  filter  estimators 
w„  generated  by  the  algorithm  of  Fig.  1.2  are  considered  instead,  tuning  of  the  real-valued  parameters  ^ 
and  eo  is  virtually  replaced  by  an  integer  choice  among  the  first  several  members  of  the  {w„}  sequence. 
Adaptive,  data-dependent  criteria  for  the  selection  of  the  most  appropriate  AV  filter  in  the  sequence  for  a 
given  data  record  are  developed  in  the  next  section.  In  Fig.  1.5(a),  for  M  =  256  all  estimators  w„  from 
n  =  2  up  to  about  n  =  55  outperform  in  MS-error  their  RLS,  LMS,  and  SMI  (wM)  counterparts.  w8  (n  =  8 
auxiliary  vectors)  has  the  least  MS-error  of  all  (best  bias/variance  trade-off).  When  the  data  record  size 
is  increased  to  M  =  2048  (Fig.  1.5(b)),  we  can  afford  more  iterations  (more  auxiliary  vectors)  and  W13 
offers  the  best  bias/variance  trade-off  (lowest  MS-error).  All  filter  estimators  w„  for  n  >  8  outperform  the 
LMS/RLS/SMI  (wM)  estimators.  For  such  large  data  record  sets  (M  =  2048),  the  RLS  and  the  SMI  (wM) 
MS-error  are  almost  identical.  Fig.  1.6  offers  a  3-dimensional  plot  of  the  MS  estimation  error  as  a  function 
of  the  number  of  auxiliary  vectors  n  and  the  sample  support  M.  The  dark  line  that  traces  the  bottom  of  the 
MS  estimation  error  surface  identifies  the  best  number  of  auxiliary  vectors  for  any  given  data  record  size  M. 

An  alternative  bias/ variance  trading  mechanism  through  real- valued  tuning  is  the  diagonally- loaded 
(DL)  SMI  estimator  [6] 


Wdl-smi  (A) 


: 

R  (M)  +  AI 

-1 

V 

R(M)  +  AI 

1  _1 
[  V 

(1.18) 


where  A  >  0  is  the  diagonal  loading  parameter.  We  observe  that  wDL_SMI(A  =  0)  is  the  regular  SMI 
estimator,  while  lim  wdl.SMi(A)  =  /pv  which  is  the  properly  scaled  matched  filter.  In  Fig.  1.7(a)  we 

A— >00 

plot  the  MS  estimation  error  of  the  DL-SMI  estimator  as  a  function  of  the  diagonal  loading  parameter  A 
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Fig.  1.7:  MS  estimation  error  studies  for  (a)  diagonally  loaded  SMI,  (b)  multistage,  and  (c) 
auxiliary- vector  estimators  (M  =  60). 


(M  =  60).  We  identify  the  best  possible  diagonal  loading  value  A  ~  3.45  (at  significant  computational  cost) 
and  in  Fig.  1.7(c)  we  compare  the  best  DL-SMI  estimator  against  the  AV  estimator  sequence  for  which  no 
diagonal  loading  is  performed.  Interestingly,  the  AV  estimators  w„  from  n  =  4  to  7  outperform  in  MS  error 
the  best  possible  DL-SMI  estimator  (A  ~  3.45). 

Finally,  a  finite  set  of  L  filter  estimators  with  varying  bias/covariance  balance  can  be  obtained  through  the 
use  of  the  orthogonal  “multistage”  filter  decomposition  procedure  in  [9],  [10].  It  can  be  shown  theoretically 
that  the  1-stage  filter,  w ;.atage,  0  <  l  <  L  —  1,  is  equivalent  to  the  following  structure.  First,  change  the 
auxiliary- vector  generation  recursion  in  (1.9)  or  Fig.  1.2  to  impose  orthogonality  not  only  with  respect  to 
the  constraint  vector  v  but  also  with  respect  to  all  previously  defined  auxiliary  vectors  yi,  y2, .  ■  • ,  y«— 1.» 
n  <  L  —  1  : 


th 


yn=  i  - 


Rw„_i. 


(1.19) 


Next,  terminate  the  recursion  at  n  =  l,  0  <  l  <  L  —  1,  and  organize  the  l  orthogonal  to  each  other  and  to  v 
vectors  yi, . . . ,  y ;  in  the  form  of  a  blocking  matrix  B £X;  =  [yi,  y2, . . . ,  y;] .  Then, 


where 


W/.  stage 


Bixi<5/xi 


a  =  [B^RB]  1BflRv 

y  F  L  J 


(1.20) 

(1.21) 
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Fig.  1.8:  MS  estimation  error  for  the  best  multistage  and  AV  estimators  over  the  data  support 
range  M  =  ^  =  16  to  M  =  3L  =  96.  The  MS  estimation  error  of  the  A  =  3.45  DL-SMI  estimator 
is  also  included  as  a  reference. 


is  the  MS  vector-optimum  (unconditionally  optimum)  set  of  weights  of  the  vectors  yx,  y2, . . . ,  y/.2  In  the 
context  of  MMSE/MVDR  filter  estimation  from  a  data  record  of  size  M,  wo_atage(-M)  is  the  matched  filter 
and  w (i-i). stage (Af)  is  the  SMI  estimator.  In  Fig.  1.7(b)  we  plot  the  MS  estimation  error  of  w;_stage(M)  as 
a  function  of  l,  0  <  l  <  L  —  1  =  31,  (M  =  60).  We  identify  the  best  multistage  estimator  (1  =  3  stages)  and 
in  Fig.  1.7(c)  we  compare  against  the  AV  estimator  sequence.  We  see  that  all  AV  estimators  w„  from  n  =  3 
to  8  outperform  in  MS-error  the  best  multistage  estimator  (1  =  3  stages).  Finally,  as  a  last  study,  in  Fig.  1.8 
we  plot  the  MS  error  of  the  A  =  3.45  DL-SMI  estimator  together  with  the  MS  error  of  the  best  multistage 
and  AV  estimators  over  the  data  support  range  M  =  ^  =  16  to  M  =  3L  =  96. 

1.4  How  to  Choose  the  Number  of  Auxiliary  Vectors 

In  this  section  we  present  two  data-driven  rules  for  the  selection  of  the  number  of  auxiliary  vectors  n  [39]. 
The  first  rule  selects  the  AV  filter  estimator  with  n  auxiliary  vectors  that  has  minimum  cross-validated 
average  filter  output  energy.  The  second  selection  rule  is  specific  to  BPSK  communications  receivers  that 
employ  a  sign  detector  at  the  output  of  the  linear  auxiliary- vector  filter.  Details  are  given  below. 

2Therefore,  the  multistage  filter  in  [9],  [10]  is  identical  to  the  filter  1wb’  as  it  appears  in  [22]-[24].  The  multistage  decompo¬ 
sition  algorithm  is  a  computationally  efficient  procedure  for  the  calculation  of  this  filter  tailored  to  the  particular  structure  of 
BffRB  (tri-diagonal  matrix).  The  same  computational  savings  can  be  achieved  by  the  general  forward  calculation  algorithm  of 
Liu  and  Van  Veen  [38]  that  returns  all  intermediate  stage  filters  along  the  way,  up  to  the  stage  of  interest  l  (total  computational 
complexity  of  order  0((M  +  Z)Z/2)).  The  AV  algorithm  in  Fig.  1.2  has  computational  complexity  O  ((M  +  n)  L2)  where  n 
is  the  desired  number  of  auxiliary  vectors.  Again,  all  intermediate  AV  filters  are  returned.  Estimators  of  practical  interest 
have  I  <  M  or  n  C  M.  Therefore,  the  complexity  of  all  such  algorithms  is  dominated  by  O  ( ML 2)  which  is  required  for  the 
computation  of  R(M). 
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1.4.1  Cross- Validated  Minimum-Output- Variance  Rule  (CV-MOV) 

Cross-validation  is  a  well-known  statistical  method  [40].  Here,  we  use  cross-validation  to  select  the  filter 
parameter  of  interest  (number  of  auxiliary  vectors  n )  that  minimizes  the  output  variance  which  is  estimated 
based  on  input  observations  that  have  not  been  used  in  the  process  of  building  the  filter  estimator  itself. 

A  particular  case  of  cross-validation  that  we  use  in  this  work  is  the  “leave-one-out”  method.  The  following 
criterion  defines  the  CV-MOV  AV  filter  estimator  selection  process. 

Criterion  1.1  For  a  given  data  record  of  size  M ,  the  cross-validated  minimum- output-variance  AV  filter 
estimator  selection  rule  chooses  the  AV  filter  estimator  wni(M)  that  minimizes  the  cross-validated  sample 
average  output  variance,  i.e. 


n±  —  argmin 

n 


(M\m)  i 


,rrfw. 


( M\m ) 


(1.22) 


where  ( M\m )  identifies  the  AV  filter  estimator  that  is  evaluated  from  the  available  data  record  after  removing 
the  m-th  sample.  □ 


It  may  be  important  to  emphasize  the  need  for  invoking  the  cross-validation  technique  for  the  evaluation 
of  the  sample- average  output  variance.  If  sample-average  evaluation  using  all  data  were  attempted,  then  the 
selection  rule  would  take  the  form  min  |w^(M)R(M)w„(M)| ,  where  R(M)  =  -A  X!m=i  rmrm  and  w„(M) 

is  given  by  the  algorithm  of  Fig.  1.2  with  R (M)  in  place  of  R.  Such  minimization,  however,  would  result 
in  n  =  oo  since  we  know  that  w^(M)R(M)w„(M)  — »  w^](M)R(M)w00(M)  and  the  SMI  estimator 

n— >  oo 

w oo(M)  achieves  minimum  sample-output-variance  w^(M)R(M)w00(M)  (but  not  minimum  true  output 
variance  w^,(M)Rw00(M),  of  course). 


1.4.2  Output  J-divergence  Rule 

For  illustration  purposes  we  reconsider  the  BPSK  CDMA  signal  model  example  in  (1.14).  The  output 
J-divergence  rule  selects  the  AV  filter  estimator  from  the  sequence  of  AV  estimators  that  maximizes  the  J- 
divergence  of  the  Gaussian  approximated  conditional  filter-output  distributions  (conditioned  on  the  transmit¬ 
ted  information  bit  bi  =  +1  or  bi  =  —1).  The  appropriateness  of  such  a  criterion  as  well  as  implementation 
details  are  presented  below. 

For  a  given  AV  filter  estimator  w„(M),  we  denote  by  pm  the  real  part3  of  the  filter  output  with  input 
the  mth  data  vector  rm,  m  =  1, 2, . . . ,  M: 

pm  =Re  [wf  (M) rm]  ( 1.23) 

K 

=  \flf1Re  [&i(m)w^ (M)s|J  +  ^  \f^kRe  [bk(m) w,f (M)sk]  +  Re  [w(f (M) nm] 

k= 2 


where  the  information  bits  bk (itl),  k  =  1,2, ... ,  K,  m  =  1,2,...,  M,  are  assumed  to  be  independent  identically 
distributed  (i.i.d.)  with  equally  probable  outcomes  and  nm  is  a  0-mean  complex  white  Gaussian  random 
vector  with  autocovariance  matrix  a2I.  Then,  the  scalars  pm,  m  =  1,2,...,  M ,  are  i.i.d.  with  common 
distribution  fp(x)  given  by 


fp(x) 


2KV‘2r, 


Hexp<j 


-{*“£*=  iVMtRe  b^-wff  (M)sfej  |  | 


2cr2  I 


.(M)ir 


(1.24) 


where  lg,  \  i  =  1,  2, . . . ,  2K ',  is  the  bit  of  user  k  in  the  it h  bit-combination. 

3  While  the  signal  model  in  (1.14)  is  real- valued,  we  choose  to  carry  out  this  presentation  in  the  more  general  context  of 
complex  input  vectors  and  filters. 
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Conditioned  on  the  transmitted  information  bit  of  the  user  of  interest,  user  1,  the  pdf  of  the  filter  output 
is  a  mixture  of  2A_1  Gaussian  distributions.  However,  for  “effective”  interference  suppressive  filters  we 
can  safely  approximate  the  conditional  output  distribution  by  a  Gaussian  distribution  as  argued  in  [41]  for 
MMSE/MVDR  linear  filtering.  Under  this  approximation,  the  filter  output  conditional  distributions  given 
that  +1  or  —1  is  transmitted  are  fin  ~  Af  [p(n),<j2+N(n)]  and  fo  n  ~  Af  [— p(n),  crj+N(n)\ ,  respectively, 

where  jtt(n)  =  \[E\Re  [w^  and  cr2+N{n)  =  Yk~2  EkRe  [w^(M)s/,]“  +  a2  |jwn(M)||2  is  the  condi¬ 

tional  variance  due  to  multiple-access-interference  and  AWGN  (the  index  “I+N”  denotes  comprehensively 
the  disturbance  contribution).  The  effect  of  the  above  approximation  on  the  performance  of  the  output 
J-divergence  selection  rule  will  be  examined  in  Section  V. 

The  J-divergence  distance  J(/i,n,  /o,n)  between  the  distributions  /l  n(-)  and  /o,n(-)  is  defined  as  the  sum 
of  the  Kullback-Leibler  (K-L)  distances  between  fi  n  and  /o,n 


j(fl,n,  fo,n)  ~  fo,n)  +  -D(/o,n;  fl,n) 


(1.25) 


where  the  K-L  distance  of  f1<n  from  /0,„  is  defined  by  _D(/i,„,  /0,„)  =  f^Zofi,n(x)  log  [42],  Since 

J(/i  „,  /o,n)  is  a  function  of  the  AV  filter-estimator  parameter  n  (number  of  auxiliary  vectors),  in  the  rest 
of  this  chapter  we  will  use  the  notation  J(n)  to  represent  the  J-divergence  distance  between  fi,n(x)  and 

fo,n{x).  For  the  Gaussian  approximated  pdf’s  /i,„  and  /0i„,  we  have  D(/i,„,  /0,„)  =  T>(/o,m  fi,n)  = 
and  the  J-divergence  simplifies  to 


J{n) 


4A*2  M  _ 

ct?+jvW 


(1.26) 


Expression  (1.26)  justifies  our  choice  of  the  output  J-divergence  as  one  of  the  underlying  rules  for  the  selection 
of  the  best  AV  filter  estimator.  We  recall  that  under  the  same  Gaussian  approximation  of  the  conditional 

filter-output  pdf’s  the  filter  output  SINR  can  be  expressed  as  and,  consequently,  the  BER  as  Q( 
where  Q{x)  =  f^°  exp  ^^jdu.  To  this  extent,  maximization  of  the  output  J-divergence  in  (1.26) 
implies  minimization  of  the  BER.  Therefore,  we  propose  to  select  the  estimator  from  the  generated  sequence 
of  AV  filter  estimators  that  exhibits  maximum  estimated  J-divergence. 


1 )  Supervised  Output  J-divergence  Rule 

Exploiting  the  symmetry  of  and  /o,n(-);  we  can  show  in  a  straightforward  manner  that 


J{n)  = 


4  E2{b\Re  [w^(M)r]} 
Var  {biRe  [w^(M)r]} 


Y,  E  {i  Re  [w^(M)r]  \bx  =  i }  Pr(bi  =  i ) 
*  I  i 


Y  Var  {Re  [w^(M)r]  |6j  =  i}  Pr(b\  =  i) 

i=±  1 


(1.27) 


(1.28) 


where  Var{-}  and  Pr{-}  denote  variance  and  probability,  respectively.  Assuming  availability  of  a  pilot 
information  bit  sequence  {bi(m)}^=1,  we  propose  to  estimate  J(n)  by  estimating  statistical  expectations  and 
probabilities  via  sample  averaging  and  frequencies  of  occurrence,  respectively.  We  note  that  although  (1.27) 
and  (1.28)  are  ideally  equivalent  (when  all  statistical  quantities  are  known),  this  is  not  the  case  in  general 
when  estimated  measures  are  considered.  So,  let  {p f , p £ , p^ , . . . , p\Ii }  =  {Re  [wA(M)rm]  :  b\{m)  =  +1, 

m  =  1,2,3...,  M)  and  {p^  ,p^  ,V%  ,  ■  ■  ■  ,Vm2}  =  {Re  [w^(M)rm]  :  bi(m)  =  -1,  m  =  1,2,3 ,  ...,M}  be  the 
sets  of  all  filter  outputs  under  b\{rn)  =  +1  and  bi(m)  =  —1,  respectively  (Mj,  M2  7^  0  and  Mi  +  M2  =  M). 
First,  it  can  be  shown  in  a  straightforward  manner  that  the  estimator  of  the  numerator  of  (1.28)  (that  is, 
the  weighted  average  of  the  sample  average  mean  of  the  set  {Pm}mL  1  and  the  sample  average  mean  of  the 
set  {Pm}m=  1  weighted  by  the  frequency  of  each  set)  is  equivalent  to  the  estimator  of  the  numerator  of  (1.27) 

4/!2(n)  where  /t(n)  =  ^ m=M  +14™  Pm  (jn  fact;  /*(n)  is  the  minimum  variance  unbiased  estimator  of  /i( n )  [43, 
P-  178]). 

Estimators  of  the  denominator  of  (1.27)  and  (1.28)  are  examined  in  the  following  proposition.  The  proof 
is  included  in  the  Appendix. 
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Proposition  1.3  Consider  the  estimator  of  cr)+N(n)  which  is  the  weighted  average  of  the  sample  average 
variance  of  the  set  {ptn}mLi  an d  the  sample  average  variance  of  the  set  1;  weighted  by  the  frequency 

of  occurrence  of  each  set: 


'rl(n)  = 


Mi 


M  i  +  M2 


;2+ 


(n) 


Mo 


Mi  +  M2 


"W 


(1.29) 


where 


°2+(n)  =  -  J2(p+-fi+(n))2, 

1  1  m= 1 

.  M2 

v2~(n)  =  JT  ~T~(n))2, 


T+(n)  = 

m= 1 

_  A  M2 


m=l 


and  {PtoItoLi,  {pm}mii;  Mi  and  M2  are  as  defined  previously.  Consider  also  the  direct  sample  average 
estimator  of  a)+N(n): 

Mi+m2 


\(n)  = 


1 


Mi  +M2 


^  [6i(rn)pm  -/t(n)]2 


(1.30) 


m=  1 


where  pm  is  given  by  (1.23).  The  estimators  crfrn)  and  <?|(n)  exhibit  the  following  properties:  (i)  They  are 
both  biased  and  (ii)  5|(n)  exhibits  smaller  MSE  from  the  true  value  than  a2 (n).  □ 

Utilizing  Proposition  1.3,  estimators  for  the  filter-output  J-divergence  become  readily  available.  The 
following  theorem  identifies  their  relative  merits.  The  proof  is  included  in  the  Appendix. 


Theorem  1.2 


Define  the  two  supervised  estimators  of  the  output  J-divergence  .J,sp  (n)  =  +2 and  Js+n)  = 

4,2  /n) ,  where  the  subscript  “S”  identifies  a  supervised  implementation.  Then,  for  a  given  information  bit 
a2  W 

pilot  sequence  of  size  M  =  Mi  +  M2,  where  Mi  and  M2  are  the  cardinalities  of  the  sets  {bi(m)  =  +1}  and 
{bi(m)  =  —1}  respectively,  both  estimators  are  biased  while  the  MSE  of  Js+n)  is  less  than  the  MSE  of 
Js,i(n),  i.e. 


E- 


?  4/i2(n) 

Js,2(n ) - 2 - 

I+N 


<U_i_  AT  (fr)  \ 


<E 


?  V(n) 

Js,  l(n)  ^2 

I+N 


o-AAr(n) 


(1.31) 


where  Jfr  is  the  true  value  of  J{n)  as  given  by  (1.26). 


Using  the  preferred  estimator  Js,2(n),  the  supervised  implementation  of  the  output  J-divergence  AV 
filter  estimator  selection  rule  takes  the  final  form  given  by  the  following  criterion. 

Criterion  1.2  For  a  given  information  bit  pilot  sequence  of  size  M,  the  supervised  J-divergence  AV  filter 
estimator  selection  rule  chooses  the  estimator  w„2(M)  with  n2  auxiliary  vectors  where 


n2=arg  max- 


;{-Es,2(n)} 


=arg  max< 


~M  Em= 1  bi  (m)Re  [w^ (M)rm 


1 h  Em= 1  [bi(m)Re  [wff  (M) rm]  -  fi(n)X 


(1.32) 


2)  Unsupervised  (Blind)  Output  J-divergence  Rule 

The  blind  implementation  of  the  rule  is  obtained  by  substituting  the  information  bit  61  in  (1.27)  by 
the  detected  bit  bi  =  sgn  [Re  [w^(M)r]]  (output  of  the  sign  detector  that  follows  the  linear  filter).  In 
particular,  using  bi  in  place  of  bi  in  (1.27)  we  obtain  the  following  J-divergence  expression: 


JB(ri) 


4  £2<{ 

b\Re  w^(M)r 

}  AE2{\ 

\Re  [w(f(M)r] 

1} 

V  ar  - 

(£1  Re  (M)  r] 

|  Var  { |  Re  [w^  (M)  r]  | } 
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(1.33) 


where  the  subscript  “B”  identifies  the  blind  version  of  the  J-divergence  function.  The  following  proposition 
provides  the  conditions  under  which  J_b(u)  is  nearly  equal  to  J(n).  The  proof  is  included  in  the  Appendix. 


Proposition  1.4  If  »  1,  be.  the  filter  output  SINR  is  significantly  higher  than  0 dB,  then 

~  J(n).  □ 

To  estimate  Js(n)  from  a  data  record  of  finite  size,  we  substitute  the  statistical  expectations  in  (1.33) 
by  sample  averages.  The  following  criterion  summarizes  the  corresponding  AV  filter  estimator  selection  rule. 


Criterion  1.3  For  a  given  data  record  of  size  M,  the  unsupervised  (blind)  J-divergence  AV  filter  estimator 
selection  rule  chooses  the  estimator  w„3  (M)  with  n%  auxiliary  vectors  where 


ri3=argmax- 


=arg  max< 


:{  Jb  (n)  | 


bE!=i  \Re  [w^(M)rm]| 

l 

[if  Em= 1  \Re  i^n  (M)rm]|2- 

'if  Em=l  \Re  [™n  (Af)rm]  |' 

1 

(1.34) 


□ 


1.5  Simulation  Studies 


We  examine  the  performance  of  the  proposed  short-data-record  AV  filter  estimator  selection  rules  for  a  DS- 
CDMA  system  with  K  users,  spreading  gain  L,  and  multipath  fading  reception  by  a  narrowband  antenna 
array  with  N  elements.  All  elements  experience  identical  fading.  Let  J  denote  the  number  of  chip-interval¬ 
spaced  paths  per  baseband  user  signal.  After  conventional  carrier  demodulation,  chip-matched  filtering  and 
sampling  at  the  chip  rate  over  a  multipath  extended  symbol  interval  of  L  +  J  —  1  chips,  the  L  +  J  —  1  data 
samples  from  the  i-th  antenna  element,  i  =  1, 2, . . . ,  TV,  are  organized  in  the  form  of  a  vector  r„  given  by 

K  j 

rm  =  12  ck,tVEk(bk(m)sk:t  +  bf  (m)s^  +  6+  (m)s^t)ak,t[i\  +  n^>,  (1.35) 

k= 1  t=  1 

m  —  1 ,  ,  M,  i  =  1, . . . ,  TV. 


In  (1.35),  with  respect  to  the  A:-th  user  signal,  Ek  is  the  transmitted  energy,  bk(m),  b(((m),  and  b^(m)  are 
the  present,  the  previous,  and  the  following  transmitted  bits,  respectively,  and  Ck,t  is  the  coefficient  of  the 
A-th  path  of  the  A:-th  user  signal.  The  channel  coefficients  are  modeled  as  independent  zero-mean  complex 
Gaussian  random  variables  that  are  assumed  to  remain  constant  over  the  filter  adaptation  data  record  of 
size  M.  s k,t  represents  the  ( J  —  l)-zero-padded  and  (t  —  l)-right-shifted  version  of  the  signature  of  the  A:-th 
user  Sfc,  sft  is  the  0-filled,  L-left-shifted  version  of  s k,t,  and  s£t  is  the  0-filled,  L-right-shifted  version  of  s k,t- 

Finally,  n„  represents  additive  complex  white  Gaussian  noise  and  akg[i]  denotes  the  i- th  coordinate  of  the 
array  response  vector  a k,t  that  corresponds  to  the  A-th  path  of  the  A:-th  user  signal: 


^},  (1.36) 

where  6k,t  is  the  angle  of  arrival,  A  is  the  carrier  wavelength,  and  q  is  the  inter-element  spacing  (in  our 
studies  we  set  q  =  A/2). 

We  “vectorize”  the  (L  +  J  —  1)  x  N  space-time  received  data  matrix  [rm  ,  r™  ,  •  •  • ,  r^[  to  form  the  joint 
space-time  data  vector  rm,  which  is  a  (L  +  J  —  l)7V-long  column  vector: 

rm  =  Vec{[r£\r£\...,r£\L+J_l)xN}.  (1.37) 


The  joint  space-time  RAKE  filter  for  user  1  is  vx  =  Et,1{rm61(?n)}  =  Veclfv!  j,-, v12,  •••,v1jv]}  (E^D} 
denotes  statistical  expectation  with  respect  to  bi(m)),  and  v-|  =  Y^t= l  ci,*si,tai,t[*]j  *  =  1,2,..., TV.  The 
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Fig.  1.9:  Histogram  of  the  two  differences  n±  —  nopt  and  n 3  —  nopt  where  n\  is  the  CV-MOV  choice 
and  ns  is  the  blind  J-divergence  choice  (M  =  230,  SNRi  =  8dB). 


MMSE/MVDR  filter  is  built  with  constraint  vector  v  =  vl5  desired  response  wHvi  =  1,  and  autocorrelation 
matrix  R  =  E{ rmr^}. 

We  choose  K  =  20,  IV  =  5,  J  =  3  paths  with  independent  zero-mean  complex  Gaussian  fading  coefficients 
of  variance  one  (i.e.,  E{\ck:t\2}  —  1),  and  Gold  signatures  with  processing  gain  L  =  31.  The  total  SNR’s  (over 
the  three  paths)  of  the  19  interferers  are  set  at  SNR2-6  =  6dB,  SNR7_g  =  7dB,  SNRg_i3  =  8dB,  SNRi4_is  = 
9dB,  SNRi6_20  =  lOdB.  The  space-time  product  (filter  length)  equals  (L  +  J  —  i)N  =  (31  +  2)5  =  165.  All 
experimental  results  that  follow  are  averages  over  100  different  channel  realizations  and  10  independent  data 
record  generations  per  channel. 

We  first  examine  the  performance  of  the  AV  filter  estimator  selection  rules  under  the  assumption  that 
no  info-bit  pilot  sequence  is  available.  The  data  record  size  is  set  equal  to  M  =  230  while  the  total  SNR  of 
the  user  of  interest  is  set  at  SNRi  =  8dB.  In  Fig.  1.9,  we  plot  the  empirical  pdf  of  the  differences  (ni  —  nopt) 
and  (713  —  nopt)  where  ni  and  713  denote  selections  according  to  Criterion  1.1  and  1.3,  respectively,  while 
nopt  denotes  the  “genie”  maximum  SINR  optimum  choice  of  the  number  of  auxiliary  vectors.  We  observe 
that  both  criteria  provide  a  reliable  estimate  of  the  “genie”  assisted  optimum  number  of  auxiliary  vectors. 

The  overall  short-data-record  adaptive  filter  performance  is  examined  in  Figs.  10  and  11.  In  Fig.  1.10, 
we  plot  the  BER4  of  the  AV  filter  estimators  w ni(M)  and  w „3(M)  as  a  function  of  the  SNR  of  the  user 
of  interest  for  data  records  of  size  M  =  230.  The  BER  curve  of  the  “genie”  assisted  BER  optimum  filter 
choice  wn  (M)  as  well  as  the  corresponding  curves  of  the  ideal  MMSE/MVDR  filter  wMMSE/MVDR,  the 
SMI  filter  estimator  woc(Af),  the  S-T  RAKE  matched-filter  (MF)  wo (M)  =  vi,  and  the  multistage  filter  [9], 
[10]  with  the  preferred  number  of  stages5  l  =  7  are  also  included  for  comparison  purposes.  We  observe  that 
both  w ni(M)  and  w „3(M)  are  very  close  to  the  “genie”  optimum  AV  filter  estimator  choice  and  outperform 
significantly  the  SMI  filter  estimator,  the  multistage  filter  estimator,  and  the  matched  filter.  We  also  observe 

4The  BER  of  each  filter  under  consideration  is  approximated  by  Q  (VSINRout)  [41],  since  the  computational  complexity  of 
the  BER  expression  for  this  antenna  array  CDMA  system  prohibits  exact  analytic  evaluation. 

5  In  [44]  it  is  argued  that  1  =  7  ( D  =  8  in  the  notation  of  [44])  stages  is  “nearly  optimal  over  a  wide  range  of  loads  and 
SNRs”. 
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Fig.  1.10:  BER  versus  SNR  for  the  user  signal  of  interest  (M  =  230). 


that  for  moderate  to  high  SNR’s  of  the  user  of  interest,  the  J-divergence  selection  rule  is  slightly  superior  to 
the  CV-MOV  selection  rule.  The  opposite  is  true  in  the  low  SNR  range.  This  is  explained  by  the  fact  that 
the  J-divergence  approximation  J(n)  ss  Js(n)  used  in  Proposition  1.4  is  less  accurate  for  low  filter  output 
SINR  values.  On  the  other  hand,  for  high  filter  output  SINR’s  the  discrimination  capability  of  the  CV-MOV 
rule  is  not  as  sharp. 

Finally,  Fig.  1.11  repeats  the  study  of  Fig.  1.10  as  a  function  of  the  data  record  size.  The  SNR  of  the 
user  of  interest  is  fixed  at  8dB. 


1.6  Concluding  Remarks 

In  this  work  we  relied  strictly  on  statistical  conditional  optimization  principles  to  derive  an  iterative  algorithm 
that  starts  from  the  “white- noise  matched  filter”  and  converges  to  the  “MMSE/MVDR  filter”  solution  for  any 
given  positive  definite  input  autocorrelation  matrix.  The  conceptual  simplicity  of  the  employed  conditional 
optimization  criteria  led  to  a  computationally  simple  iteration  step.  We  analyzed  basic  algorithmic  properties 
and  we  established  formal  convergence  to  the  MMSE/MVDR  filter. 

When  the  input  autocorrelation  matrix  is  substituted  by  a  sample- average  (positive  definite)  estimate, 
the  algorithm  generates  a  sequence  of  filter  estimators  that  converges  to  the  familiar  sample-matrix-inversion 
(SMI)  unbiased  estimator.  The  bias  of  the  generated  estimator  sequence  decreases  rapidly  to  zero  while  the 
estimator  covariance  trace  rises  slowly  from  zero  (for  the  initial,  fixed-valued,  matched-filter  estimator) 
to  the  asymptotic  covariance  trace  of  SMI.  Sequences  of  practical  estimators  that  offer  such  exceptional 
control  over  favorable  bias/covariance  balance  points  are  always  a  prime  objective  in  the  estimation  theory 
literature.  Indeed,  for  finite  data  record  sets,  members  of  the  generated  sequence  of  estimators  were  seen 
to  outperform  in  MS  estimation  error  LMS/RLS-type,  SMI  and  diagonally  loaded  SMI,  and  orthogonal 
multistage  decomposition  filter  estimators.  In  addition,  the  troublesome,  data-dependent  tuning  of  the 
real-valued  LMS  learning  gain  parameter,  the  RLS  initialization  parameter  or  the  SMI  diagonal  loading 
parameter  is  replaced  by  an  integer  choice  among  the  first  several  members  of  the  estimator  sequence.  Two 
data-driven  criteria  were  proposed  for  the  identification  of  the  best  AV  filter  estimator  in  the  sequence. 
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Fig.  1.11:  BER  versus  data  record  size  (SNRi  =  8dB). 


The  first  criterion  calls  for  the  minimization  of  the  cross-validated  filter-estimator  output  variance.  The 
second  criterion  calls  for  the  maximization  of  the  J-divergence  of  the  filter-estimator-output  conditional 
distributions.  Simulation  studies  examined  and  compared  the  operational  characteristics  of  the  proposed 
selection  methods.  With  respect  to  the  relative  merits  of  the  minimum  cross-validated  output  variance  and 
the  maximum  output  J-divergence  selection  rules,  we  observed  that  for  moderate  to  high  output  SINR’s  the 
latter  method  appears  superior  to  the  former  (for  high  SINR’s  the  cross-validated  minimum  output  variance 
rule  is  not  as  sharp  in  discrimination  ability).  In  contrast,  in  low  output  SINR  the  J-divergence  method  is 
somewhat  lacking  in  performance  (technically,  the  approximation  in  Proposition  1.4  is  less  accurate  for  near 
OdB  or  lower  output  SINR  values).  As  a  final  general  comment,  the  use  of  a  sufficiently  long  antenna  array 
in  combination  with  an  “effective”  interference  suppressive  filter  can  result  in  high  output  SINR  which  favors 
the  J-divergence  selection  rule,  even  when  the  transmitted  energy  of  the  user  of  interest  is  much  lower  than 
that  of  the  interferers. 

The  “auxiliary-vector”  algorithm  in  Fig.  1.2  together  with  Criteria  1,  2,  and  3  form  a  complete  toolbox 
for  state-of-the-art  estimation  of  MMSE/MVDR  filters.  The  developments  are  of  particular  interest  in 
high-dimensional  adaptive  signal  processing  applications  that  rely  on  data  records  of  limited  size. 

1.7  Appendix 

Signature  Assignment  for  the  DS/CDMA  Example  of  Sections  III  and  IV 
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The  matrix  S32xi3  =  [sis2  •  •  • 

s33]  with  columns 

the 

signature  vectors  si, 

S2,  . 

. .,  s33  is  given  below. 
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Proof  of  Proposition  1.3 

The  quantities  ^  Xlmii  ( Pm  ~  M+(n))  and  jgj  Emli  fc  —  /t_(n))  are  the  ML  estimators  for  the  variance 
of  the  filter  output  conditioned  on  bi  =  +1  and  bi  =  —1,  respectively  [43,  p.  179].  Both  estimators  are 
biased.  (In  fact,  their  unbiased  counterparts  that  have  multiplying  factors  and  instead  of  -d- 

and  ^ ,  respectively,  exhibit  higher  MSE).  The  MSE’s  of  the  estimators  of  interest  a\  (n)  and  5|(n)  are  as 
follows: 


and 


Thus, 
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MSE52(n)  -  MSE5|(„)  = 


°J+ivM 
(Mi  +  M2)2 
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(1.39) 


(1.40) 
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Proof  of  Theorem  1.2 

] u+(n)  and  cr2+(n)  are  independent  random  variables  with  distributions  /t+(n)  ■A/~(Mn),  CTJ+M!(K))  and 

(n)  ~  Xmi-i  [45]  (xmi-i  denotes  the  chi-square  distribution  with  (Mi  —  1)  degrees  of  freedom). 


Mi 


f(") 


;2+ 
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Similarly,  /t“(n)  and  a2~(n)  are  independent  random  variables  with  distributions  /t“(n)  ~  A f  /i(n), 

and  ~  Xm2- i-  Furthermore,  (fi+(n)  ,  a2+(n))  and  (n) ,d2~ (n))  are  mutually  independent 

because  (/t+(n),  a2+(n))  and  (/J-  (n) ,  <r2_  (n))  are  evaluated  from  two  independent  training  sets. 

Since  /I(n)  =  and  (j2(n)  =  ^<r2+(n)  +  ^ a2~  (n ),  /t(n)  and  <j2(n)  are  independent 


random  variables  with  distributions  /i(n)  ~  AT 


rj+jv(n) 

M 


Mi  +  M2  is  the  size  of  the  given  information  bit  pilot  sequence.  Therefore, 
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|af(n)  } 


)  aIld  l(n)  ~  XM-2  where  M  = 

(1.42) 


=  4 


'J+iV 


(n) 


VW 


M 


V(n) 


M 


1  ^/+jv(n)  (M  -  4)  ^l+ivW  M  -  4  M-  4  ’ 
£{•%!  M}  =  16  E{tf  (n)}E  |  1 


(1.43) 


=  16  I  /_i4(n)  +  ^Moj+ivM  ,  3<J/+Jv(n) 


M 


M2 


M2 


CTj+jvW  (M-4)  (M  —  6) ' 


The  inverse  moments  E  j  and  E  j  af  (k)  }  exisl  for  M  >  4  and  M  >  6,  respectively,  and  are  given  by 

[46] 
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Thus,  the  MSE  of  Js,i(n)  is 
MSEi  =  E 

where 
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On  the  other  hand,  /t(n)  and  5|(n)  are  independent  random  variables  with  distributions  /t(n)  ~  AT  I  jtt(u),  7+JV 
and  a2  J^(n)g|(n)  ~  Xm_h  respectively.  Therefore, 
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where  the  first  and  second  inverse  moments  El  j  j  and  El  j  .4^  j  exist  for  M  >  3  and  M  >  5,  respec¬ 
tively,  and  a  and  6  are  given  by  (1.46). 
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From  (1.45)  and  (1.49),  assuming  M  >  6  which  is  the  condition  for  all  inverse  moments  to  exist,  we 
obtain 


MSEi  -  MSE2  = 
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Proof  of  Proposition  1.4 

Define  Y  =  |f?e  [w^(M)r]  |.  The  pdf  of  Y  is 
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where  17 (j/)  is  the  unit  step  function.  The  mean  of  Y  is 

E{Y}  =  j  yfv(y)dy  =  n(n)  +  exp  “  2M)<3 

The  series  expansion  of  Q(x)  is  [47] 
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where  Rn  =  (  — l)n+1l  •  3  •  •  •  (2 n  +  1)  J)°°  ^-42ii+2  exp  (—E)dt  is  the  remainder  which  is  always 
absolute  value)  than  the  first  neglected  term.  If  a^2\n)  >>  1;  (1.52)  can  be  written  as 
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The  variance  of  Y  is 

Yar{Y}  =  £{Y2}  -  E2{Y}  =  ^(n)  +  aj+N(n )  -  £2{Y} 
and  using  (1.54)  we  may  approximate 
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Finally,  from  (1.54)  and  (1.56)  we  obtain 
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Chapter  2 

Capacity,  Throughput,  and  Delay  of 
Slotted  ALOHA  DS-CDMA  Links 
with  Adaptive  Space-Time 
Auxiliary- Vector  Receivers 

2.1  Introduction 

Mobile  communication  systems  are  a  key  development  toward  realizing  a  global  personal  communications 
network  [1],  In  recent  years,  we  saw  significant  interest  and  commercial  activity  in  the  area  of  mobile  networks 
which  employ  a  base  station  transceiver  serving  a  number  of  mobile  terminals  (star  architecture).  Land- 
mobile  satellite  systems  and  cellular  communications  are  manifestations  of  such  networks  that  can  provide  a 
wide  range  of  radio  services  including  interactive  sessions,  file  transfer,  voice,  facsimile,  and  position  finding. 
Past  commercially  available  mobile  communication  systems  supported  information  bit  rates  of  9.6-32  kb/s. 
Effective  mobile  Internet  access  is  one  application  example  that  requires  the  support  of  higher  data  rates 
and  user  capacity.  Third  generation  (3G)  wireless  systems  promise  to  deliver  significantly  improved  data 
rates  and  capacity  [2]. 

Several  3G  wireless  system  blueprints  are  based  on  direct-sequence  code-division-multiple-access  (DS- 
CDMA)  technology.  Two  leading  proposals  are  cdma2000  (an  extension  of  IS-95B)  and  Wideband  CDMA 
(W-CDMA).  Both  depend  on  packet  data  communication  services  between  the  mobile  terminal  and  the  base 
station  [2].  In  the  past,  random  access  packet  switching  has  received  considerable  attention  for  systems  with 
high  peak-to-average  traffic  ratios  [3],  [4].  Most  of  the  early  research  effort  was  directed  toward  ALOHA- 
type  systems  that  operate  under  the  tacit  assumption  that  when  two  or  more  transmissions  occur  at  the 
same  time  all  packets  are  destroyed  due  to  collision  [5],  [6].  Various  modifications  of  the  ALOHA  scheme, 
including  the  time  slotted  version,  have  been  under  investigation.  Application  of  code  division  spread- 
spectrum  concepts  [7]  permits  simultaneous  transmission  and  successful  reception  of  two  or  more  packets. 
When  a  medium-access-control  (MAC)  protocol  keeps  the  number  of  simultaneous  transmissions  below  a 
system  capacity  threshold,  each  transmitting  user  may  be  assigned  a  unique  spreading  code  for  the  duration 
of  the  transmission  [4],  [8]- [13];  this  system  is  called  “slotted  ALOHA  DS-CDMA  with  dedicated  signatures” 
[11],  [12],  [14]  and  is  free  of  collision  errors  and  pertinent  user  identification  problems  [15]- [17]. 

In  many  cases  of  data  transmission,  the  main  measure  of  link  quality  is  the  throughput  (either  packet 
throughput  or  information  throughput)  which  is  directly  related  to  the  operational  physical  layer  packet- 
error-rate  (PER).  Physical  layer  PER  requirements  may  be  determined  by  the  type  of  service  and  its  sensi¬ 
tivity  to  delays.  For  example,  real-time  voice  imposes  stringent  delay  constraints  and  may  require  one-shot 
transmission  with  guaranteed  upper  bound  on  PER  of  about  10-2  [18].  On  the  other  hand,  data  pack¬ 
ets  can  tolerate  reasonable  delays  but  (near)  error- free  final  data  delivery  may  be  expected  [19].  In  this 
context,  a  retransmission  protocol  (ARQ)  can  soften  the  physical  layer  single-transmission  PER  require¬ 
ment  significantly.  Various  works,  for  example  [8],  [20] - [23] ,  have  analyzed  the  performance  of  DS-CDMA 
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transmissions  in  a  random  access  mode  in  terms  of  throughput,  delay  characteristics  and  system  capacity 
improvements.  In  DS-CDMA  links  with  dedicated  signatures,  the  physical  layer  PER  performance  is  lim¬ 
ited  only  by  multiple-access-interference  (MAI)  and  time- varying  channel  characteristics  (such  as  multipath 
propagation  and  fading). 

While  PER  improvements  can  be  achieved  by  employing  forward  error  correction  (FEC)  [24] ,  significant 
underlying  “raw”  bit-error-rate  (BER)  -and,  hence,  PER-  improvements  are  achieved  by  using  adaptive 
antenna  array  (“smart  antenna”)  receiver  techniques.  DS-CDMA  systems  equipped  with  adaptive  antenna 
arrays  offer  the  opportunity  for  jointly  effective  spatial  (array)  and  temporal  (code)  MAI  and  channel  noise 
suppression  [25],  [26].  Adaptive  minimum- mean-square-error  (MMSE)  filtering  algorithms,  such  as  sample- 
matrix- inversion  (SMI)  [27],  least-mean-squares  (LMS)  [28],  and  recursive-least-squares  (RLS)  [29],  [30] 
outperform  significantly  upon  convergence  the  plain  space-time  (S-T)  RAKE  filter.  However,  the  time- 
varying  channel  characteristics  require  fast  adaptive  joint  S-T  optimization  through  small  input  data  sets 
that  can  “catch  up”  with  multipath  fading  communication  channels. 

In  this  direction,  the  Auxiliary- Vector  (AV)  adaptive  filtering  theory  [26],  [31]- [33]  offers  the  opportunity 
for  effective  interference  suppression  with  short  data  records.  The  AV  algorithm  generates  a  sequence  of 
estimators  of  the  MMSE  filter.  No  explicit  or  implicit  matrix  inversion,  decomposition,  or  diagonalization 
operation  is  required.  Most  importantly,  the  early,  non-asymptotic,  elements  of  the  generated  sequence  of 
estimators  offer  favorable  bias/co  variance  balance  and  have  been  seen  to  outperform  LMS,  RLS,  or  SMI  esti¬ 
mates  in  mean-square  filter  estimation  error.  Recently,  blind  criteria  for  the  selection  of  the  most  successful 
AV  filter  estimator  in  the  generated  sequence  were  developed  [34],  [35]. 

In  this  present  work,  we  focus  on  mobile  packet  data  networks  and  we  examine  the  PER,  throughput,  and 
delay  characteristics  of  slotted  ALOHA  DS-CDMA  random  access  links  with  dedicated  signatures  equipped 
with  packet-rate  S-T  adaptive  AV-filter  receivers.  The  communication  channel  is  assumed  to  be  multipath 
Rayleigh  fading  and  the  packet  size  is  chosen  sufficiently  small  to  conform  with  the  channel  coherence  time 
(that  is,  the  channel  realization  is  assumed  to  remain  nearly  constant  during  each  packet  transmission).  For 
the  purposes  of  adaptive  joint  S-T  processing,  a  subspace  S-T  channel  estimation  procedure  is  also  developed 
based  on  the  single-antenna  channel  estimation  method  of  [36]  (appropriately  modified  and  extended  to  S- 
T  CDMA  systems).  Since  second-order  blind  channel  estimation  methods  suffer  from  phase  ambiguity, 
mean-square  optimum  phase  recovery  is  attempted  through  a  short  pilot  bit  sequence  that  is  included  as  a 
mid-amble  [37]  in  each  packet.  As  we  will  observe,  AV  S-T  filtering  results  in  improved  channel  BER,  which 
in  turn  translates  to  higher  packet  success  probability  and  higher  user  capacity  for  a  given  PER  upperbound 
constraint.  Higher  packet  success  probability  ensures  a  significant  improvement  in  terms  of  packet  throughput 
and  delay  profile  for  networks  with  or  without  user  capacity  control.  Additional  performance  improvements 
through  FEC  coding  can  be  pursued  and  FEC-enhanced  performance  studies  are  presented. 

The  rest  of  this  chapter  is  organized  as  follows.  In  Section  2.2  we  describe  the  system  and  data  model 
under  consideration.  In  Section  2.3  we  present  the  AV  filter  estimation  algorithm.  Blind  S-T  channel 
estimation  together  with  supervised  channel-phase  recovery  are  considered  in  Section  2.4.  In  Section  2.5  we 
focus  on  the  packet  data  unit  and  examine  the  overall  packet  system  performance.  A  few  concluding  remarks 
are  drawn  in  Section  2.6. 


2.2  System  Model 

We  consider  a  slotted  ALOHA  DS-CDMA  link  of  a  packet  data  network.  We  assume  the  presence  of 
K  actively  transmitting  mobile  terminals  (users)  with  dedicated  distinct  signatures  and  we  concentrate 
on  mobile-to-base-station  transmissions  (uplink  connection).  The  K  simultaneous  in  time  and  frequency 
user  transmissions  take  place  over  multipath  fading  additive  white  Gaussian  noise  (AWGN)  channels.  The 
received  signal  at  the  base  station  is  collected  by  a  narrowband  antenna  array.  For  illustration  purposes,  we 
consider  uniform  linear  arrays.  Details  and  notation  are  given  below. 

2.2.1  Data-packet  structure 

The  binary  data  of  each  user  are  organized  in  identically  structured  packets  of  size  M  bits.  The  fcth  user 
data  packet, 

(frfc(O),  6fc(l),  •  •  • ,  bk(M  —  1)}  ,  A- =  0,1,...,  A' -1,  (2.1) 
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contains  M  —  Mp  information  bits  and  Mp  pilot  bits  (bits  that  are  known  to  the  receiver).  We  recall  that 
blind  second-order  channel  estimation  methods  return  phase- ambiguous  estimates;  the  Mp  known  bits  will 
be  utilized  for  the  supervised  recovery  of  the  phase  of  such  blind  channel  estimates.  An  example  of  the 
data  packet  structure  is  shown  in  Fig.  2.1  where  the  Mp  pilot  bits  appear  as  a  mid-amble  in  the  transmitted 
packet. 

Without  loss  of  generality,  we  assume  that  each  user  may  transmit  one  data  packet  per  slot  and  the  slot 
duration  is  Ts  seconds.  Therefore,  the  data  packet  size  M  equals  the  number  of  bits  transmitted  by  each 
user  in  one  time  slot  and  if  the  duration  of  each  bit  transmission  is  Tk  seconds,  then  Ts  =  MTb . 

2.2.2  Transmitted  signal 

The  bit  sequence  (4(0),  4(1), . . . ,  bk(M  —  1)}  of  the  fcth  user  data  packet,  k  =  0, 1, . . . ,  K  —  1,  is  modulated 
by  the  binary  signature  assigned  to  that  user 

dk  =  [4(0),  4(1),...,  4  (A  -  l)]T  (2.2) 

where  4(0  £  {±1}  ,  Z  =  0, 1, ... ,  L  —  1,  and  (-)T  denotes  the  transpose  operation.  The  signature  codelength  L 
is  also  known  as  the  “processing  gain”  of  the  DS-CDMA  system.  Each  active  (transmitting)  user  is  assigned  a 
unique  dedicated  signature  that  remains  fixed  for  the  duration  of  the  transmission.  If  signature  assignment  is 
administered  directly  by  the  base  station,  then  the  signature  itself  can  be  used  for  user  identification  purposes. 
If  instead  each  user  draws  its  own  dedicated  signature  from  the  available  pool  without  further  book-keeping, 
then  user  identification  information  needs  to  be  included  in  the  M  —  Mp  information  bits  of  every  packet.1 
In  either  case,  upon  completion  of  the  transmission  the  signature  is  released  back  to  the  system.  Adopting  a 
conventional  approach,  binary  Gold  user  signatures  are  considered  in  this  work.  Recently,  quasi-stationary 
(QS)  sequences  derived  from  Gold  sequences  were  seen  to  be  well  suited  for  CDMA  slotted  ALOHA  systems 

[38]. 

The  contribution  of  the  kth  user  packet  to  the  transmitted  radio  signal  is  denoted  by 

M—l 

U*(t)  =  bk(i)VE~ksk(t  -  (2.3) 

i= 0 

The  sinusoidal  carrier  frequency  has  the  same  nominal  value  fc  for  all  users,  while  each  transmitter  has  a 
time  invariant  carrier  phase  shift  Sk(t)  is  the  user  signature  waveform  given  by 

L- 1 

Sk(t)  =  ^2  dk{l)ip(t  -  ITC )  (2.4) 

1=0 

where  ip(t)  is  the  time-limited  chip  waveform  with  duration  (period)  Tc  =  The  signature  waveform  is 

assumed  to  be  normalized  to  unit  energy  per  bit  period  (/0Tf>  sk(t)dt  =  1).  Therefore,  Ek  in  (2.3)  represents 
transmitted  energy  per  bit  for  user  k. 

2.2.3  Channel  model  and  received  signal 

The  transmitted  signal  uk(t)  of  each  user  k,  k  =  0, 1, ... . ,  K  —  1,  propagates  over  a  multipath  fading  AWGN 
channel.  All  user  transmitters  (as  well  as  the  base  station  receiver)  are  assumed  to  be  synchronized  to  the 
universal  slot  clock.  A  guard  interval  of  length  several  chip  periods  can  be  “inserted”  between  slots  to  avoid 
slot  spillover.  After  multipath  fading  channel  “processing,”  the  total  received  signal  due  to  all  K  users  is 

K- 1  N-l 

r(t)  =  X]  ak,nUk(t  ~  +  n(t )  (2.5) 

k= 0  n=  0 

where  n(t )  is  a  zero-mean  complex  white  Gaussian  noise  process  with  power  spectral  density  4)1  and  N  is  the 
number  of  resolvable  multipaths  encountered  by  the  packets  (w.l.o.g.  the  number  of  resolvable  multipaths 

1In  DS-CDMA  packet  data  systems  with  random  (non-dedicated)  signature  selection,  the  user  identification  issue 
becomes  significantly  more  complicated  due  to  the  potential  collisions  [15]-[17]. 
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N  is  assumed  to  be  the  same  for  all  user  transmissions).  With  r(t)  bandlimited  to  B  =  dr,  we  can  assume 
a  tapped-delay  line  channel  model  with  taps  spaced  at  chip  intervals  Tc  [39].  The  path  coefficients  ctk,n, 
k  =  0, 1, . . . ,  K  —  1,  n  =  0,1,...,  TV  —  1,  are  independent  zero-mean  complex  Gaussian  random  variables 
(Rayleigh  distributed  amplitude  and  uniformly  distributed  phase)  that  model  the  fading  phenomena  and  are 
assumed  to  remain  constant  over  the  entire  packet  duration.  For  example  [26],  the  path  coefficients  for  a 
typical  vehicle  mobile  transmitting  data  at  19.5  kb/s  on  a  carrier  fc  =  900MHz  remain  practically  unchanged 
for  about  280  bit  periods  (channel  fading  rate  70Hz).  Thus,  keeping  the  packet  size  M  <  280  validates  the 
quasi-static  packet  assumption. 

The  base  station  receiver  is  equipped  with  a  narrowband  linear  antenna  array  of  Z  elements.  The  Z  x  1 
array  response  vector  for  the  nth  path  of  the  A:th  user  signal  is  defined  by 

ak,n(z)  =  ej2^r>i™ek,-  ,  2=1,2 (2.6) 

where  6k, n  identifies  the  angle  of  arrival  of  the  corresponding  path,  A  is  the  carrier  wavelength,  and  d  is  the 
element  spacing  of  the  antenna  array  (usually  d  =  ^ ) .  The  received  Z  x  1  vector  at  the  antenna  array  after 
carrier  demodulation  is  given  by 


K-1M-1  N-l 

r (t)  =  ^  X]  X!  ck,nVEkSk(t  -  iTb  -  nTc) ak,n  +  n (t)  (2.7) 

k= 0  i=  0  n= 0 


where  Ck,n  =  .  Chip-matched  filtering  and  sampling  at  the  chip  rate  /r  (or  low-pass 

filtering,  Nyquist  sampling,  and  chip-rate  accumulation)  over  the  multipath  extended  L  +  N  —  1  chip  period 
prepares  the  data  for  one-shot  detection  of  the  ith  information  bit  of  interest,  say  bo(i).  We  visualize  the 
S-T  data  in  the  form  of  a  Z  x  (L  +  N  —  1)  matrix: 

Xzx(L+N- 1)  =  [r (0) ,  r (Tc), . . . ,  r((L  +  N-  2 )TC)] .  (2.8) 

To  avoid  cumbersome  two-dimensional  filtering  operations  and  notation,  we  “vectorize”  ~X-zx(l+n- i)  by 
sequencing  all  matrix  columns  in  the  form  of  a  single  vector: 

xz(i+iv-i)xi  =  Vec  {XZx(i+Jf_i)}  .  (2.9) 

From  now  on,  x  denotes  the  joint  S-T  data  in  the  <^Z(L+N~1)  complex  vector  domain. 

The  desired  information  bit  of  the  user  of  interest  is  detected  by 

b0  =  sgn  (Re  {wffx})  (2-10) 

for  any  generic  S-T  filter  vector  w,  where  sgn(-)  is  the  ±1  hard-limiter,  Re{-}  extracts  the  real  part  of 
a  complex  number,  and  (-)H  denotes  the  Hermitian  operation.  We  are  particularly  interested  in  the  joint 
S-T  MMSE  optimum  filter  wMMSE.  Section  2.3  is  devoted  to  the  development  of  short-data- record  (packet 
length)  estimators  of  the  joint  S-T  MMSE  filter. 


2.3  Auxiliary- Vector  Adaptive  Filtering 

The  foundation  of  joint  S-T  MMSE  filtering  is  the  S-T  RAKE  filter  (well  known  for  time-only  processing 
[40])  that  we  define  compactly  as  the  cross-correlation  between  the  received  S-T  data  x  and  the  desired 
information  bit  bo  : 

v0  =  Ebo  {x60}  (2.11) 

where  the  statistical  expectation  operation  Ebo  {•}  is  taken  with  respect  to  bo  only.  It  is  straightforward  to 
obtain  the  MMSE  (a.k.a.  Wiener)  filter  solution  [41]  for  our  problem  if  we  view  the  sequence  of  information 
bits  of  the  user  of  interest  as  the  “desired”  filter  output  when  the  filter  input  is  the  S-T  data  vectors  in  (2.9). 
The  solution  is  of  the  form 

Wmmse  =  cR  xvq  (2-12) 
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where  c  is  a  positive  scalar,  vo  is  the  joint  S-T  RAKE  filter  previously  defined,  and  R  =  E  {xxff }  is  the 
S-T  input  autocorrelation  matrix.  If  we  choose  c  =  vhr1_iVo  i  then  we  obtain  the  well  known  equivalent  (cf. 
(10))  minimum- variance-distortionless-response  (MVDR)  filter 


WMVDR 


R 

v^R-^o 


(2.13) 


for  which  w^VDRv0  =  1. 

The  MMSE/MVDR  filter  is  a  function  of  the  true  S-T  input  autocorrelation  matrix  R  and  the  true 
S-T  RAKE  filter  v0.  However,  neither  R  nor  v0  is  known  to  the  receiver.  In  this  section,  we  present  the 
auxiliary- vector  (AV)  algorithm  which  provides  us  with  estimates  of  the  optimum  MMSE/MVDR  filter  when 
R  is  unknown  and  sample-average  estimated  from  the  size  M  packet  data  record  xq,  xi,  . . . ,  xm- r 


R(M) 


1 

M 


\i  r 

E 

i= 0 


(2.14) 


Throughout  this  section,  vo  is  assumed  to  be  known.  A  procedure  for  the  estimation  of  vo  from  the  same 
packet  data  record  x0,  x1; . . . ,  xM_j  will  be  presented  in  Section  2.4. 

The  AV  algorithm  generates  an  infinite  sequence  of  filters  {w„})/L0  •  The  sequence  is  initialized  at  the 
S-T  RAKE  filter 

W°  =  II  V°||2  ’  (2‘15) 

llvoll 

which  is  here  scaled  to  satisfy  w^vo  =  1.  At  each  step  A:  +  1  of  the  algorithm,  k  =  0, 1,  2, . . . ,  we  incorporate 
in  Wfc  an  “auxiliary”  vector  component  gt+i  that  is  orthogonal  to  v0  and  weighted  by  a  scalar  pk+i  and  we 
form  the  next  filter  in  the  sequence 

Wfc+,1  =  w k  -  fik+i gk+i-  (2-16) 

The  auxiliary  vector  gt+i  is  chosen  to  maximize,  under  fixed  norm,  the  magnitude  of  the  cross-correlation 
between  its  output  gj^x  and  the  previous  filter  output  w^x  and  is  given  by 

H  T>  ___ 

gfc+1  =  Rwfc  -  -jj^v0.  (2.17) 

The  scalar  fik+i  is  selected  such  that  it  minimizes  the  output  variance  of  the  filter  w/j+i  or  equivalently 
minimizes  the  MS  error  between  w^x  and  /4+iSM-iX-  The  MS-optimum  pu+i  is 


Pk+ 1  = 


gf+1Rwfc 

gf+1Rgfc+i ' 


(2.18) 


The  AV  filter  recursion  is  completely  defined  by  (2. 15)- (2. 18).  Theoretical  analysis  of  the  AV  algorithm  was 
pursued  in  [33].  The  results  are  summarized  below  in  the  form  of  a  theorem. 


Theorem  2.1  Let  R  be  a  Her mitian  positive  definite  matrix.  Consider  the  iterative  algorithm  of  eqs.  (2.15)- 
(2.18). 

(i)  Successive  auxiliary  vectors  generated  through  (2.16)-(2.18)  are  orthogonal:  g^g =0,  i  =  1,2,3,... 
(however,  in  general  gfgj  yf  0  for  \i  —  j\  ^  1). 

(ii)  The  generated  sequence  of  auxiliary -vector  weights  n  =  1,2,...,  is  real-valued,  positive,  and 

bounded:  0  <  t-T —  <  ,t  <  _!_  n  —  12,...,  where  Amax  and  Amin  are  the  maximum  and  minimum, 
correspondingly,  eigenvalues  of  R. 

(iii)  The  sequence  of  auxiliary  vectors  {gra};  n=  1,2,...,  converges  to  the  0  vector:  lim  g„  =  0.  (iv)  The 

n—too 

sequence  of  auxiliary-vector  filters  {w„}  ,  n  =  1,  2, ... ,  converges  to  the  MVDR  filter:  lim  w„  =  ]/.,  _^°  . 

n— >oo  vo  ■K'  v° 
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If  R  is  unknown  and  sample- average  estimated  from  a  packet  data  record  of  M  points  as  in  (2.14),  then 
Theorem  2.1  shows  that 

[r(M)1  v0 

W n{M)  — >  Woo{M)  =  — ! —  (2.19) 

V?  [R(M)]  v„ 

where  Wm  (M)  is  the  widely  used  MVDR  filter  estimator  known  as  the  sample-matrix-inversion  (SMI)  filter 
[27].  The  output  sequence  begins  from  w0(M)  =  which  is  a  0- variance,  fixed- valued,  estimator  that 

may  be  severely  biased  (wo  (M)  =  pppp  7^  wMvdr)  unless  R  =  a2 1  for  some  a  >  0.  In  the  latter  trivial  case, 
wo(Af)  is  already  the  perfect  MVDR  filter.  Otherwise,  the  next  filter  estimator  in  the  sequence,  wi(M),  has 
significantly  reduced  bias  due  to  the  optimization  procedure  employed  at  the  expense  of  non-zero  estimator 
(co-)variance.  As  we  move  up  in  the  sequence  of  filter  estimators  w n(M),  n  =  0, 1,  2, . . .,  the  bias  decreases 
rapidly  to  zero  while  the  variance  rises  slowly  to  the  SMI  (w^M))  levels  (cf.  (2.19)). 

An  adaptive  data-dependent  procedure  for  the  selection  of  the  most  appropriate  member  of  the  AV  filter 
estimator  sequence  {w n(M)}  for  a  given  data  record  of  size  M  is  presented  in  [34],  [35].  The  procedure 
selects  the  estimator  w„0  from  the  generated  sequence  of  AV  filter  estimators  that  exhibits  maximum  J- 
divergence  between  the  filter  output  conditional  distributions  given  that  +1  or  —1  is  transmitted.  Under 
a  Gaussian  approximation  on  the  conditional  filter  output  distribution,  it  was  shown  in  [34],  [35]  that  the 
J-divergence  of  the  filter  estimator  with  n  auxiliary  vectors  is 


4E2{b0Re  w^(M)x  } 
V ar  {6oRe  [w^(M)x]} 


(2.20) 


To  estimate  the  J-divergence  J(n)  from  the  data  packet  of  size  M,  the  transmitted  information  bits  bo 
are  required  to  be  known.  We  can  obtain  a  blind  approximate  version  of  J ( n )  by  substituting  the  information 
bit  bo  in  (2.20)  by  the  detected  bit  bo  =  sgn  (Re  {w^(M)x})  (output  of  the  sign  detector  that  follows  the 
linear  filter).  In  particular,  using  bo  in  place  of  bo  in  (2.20)  we  obtain  the  following  J-divergence  expression: 


4£2{&0Re  [w"(M)x] ) 

Mn)  =  - L - - - i 

V ar  l  6oRe  [w^(M)x]  l 


4  E2{  Re  w^(M)x  } 
Var  {|Re  [w^(M)x]  | } 


(2.21) 


where  the  subscript  “B”  identifies  the  blind  version  of  the  J-divergence  function.  To  estimate  Js(n)  from  the 
data  packet  of  size  M,  we  substitute  the  statistical  expectations  in  (2.21)  by  sample  averages.  The  following 
criterion  summarizes  the  corresponding  AV  filter  estimator  selection  rule. 


Criterion  2.1  For  a  given  data  record  of  size  M ,  the  unsupervised  (blind)  J-divergence  AV  filter  estimator 
selection  rule  chooses  the  estimator  w„0  (M)  with  no  auxiliary  vectors  where 


n0=arg  max- 


:|./B(n)  j 


=arg  max< 


ffE,=ol  Re  [w"(M)xi]| 

1 

[w  1  lRe  [™n  (M)xi]|2- 

^E,=o1|Re[w^(M)x,]| 

7 

(2.22) 


Criterion  2.1  completes  the  design  of  the  joint  S-T  auxiliary- vector  filter  estimator.  As  a  brief  summary, 
the  sequence  of  calculations  is  as  follows.  The  whole  packet  data  record  of  joint  S-T  input  data  vectors 
xo,  xi, . . . ,  xm_j  is  utilized  to  obtain  the  estimate  R (M)  of  R  according  to  (2.14).  Then,  the  sequence  of 
AV  filter  estimators  wo(M),  w1(M),  W2(M), . . .  is  generated  by  the  recursive  algorithm  of  (2.15)-(2.18)  with 
the  true  autocorrelation  matrix  R  replaced  by  its  estimate  R(M).  The  most  appropriate  AV  filter  estimator 
w n0{M)  in  the  sequence  is  selected  according  to  the  “maximum  estimated  blind  J-divergence  rule”  of  (2.22). 
Finally,  the  AV  filter  estimator  w„0  (M)  is  used  for  bit-by-bit  detection  of  the  packet  information  bit  sequence 
bo(0),  b o(l),  •  •  • ,  bo(M  —  1)  according  to  (2.10). 

For  simplicity  in  the  presentation  of  the  auxiliary- vector  filter  estimation  algorithm  in  this  section,  the 
S-T  RAKE  filter  of  the  user  of  interest  vo  was  assumed  to  be  known  which  is  certainly  not  true  in  practice. 
Section  2.4  is  devoted  to  the  estimation  of  v0  from  the  same  packet  data  record  x0,xls . . . ,  xjf_i  which,  we 
recall,  includes  Mv  pilot  bits. 
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2.4  S-T  RAKE  Filter  Estimation 


The  S-T  RAKE  filter  of  the  user  of  interest  (user  0)  vo  defined  by  (2.11)  consists  of  shifted  versions  of  the 
S-T  matched  filter  multiplied  by  the  corresponding  channel  coefficients:2 


vo 


N-l 
^  C0,n 
n= 0 


n 


T 


TV— n— 1 


(2.23) 


where  the  symbol  ©  denotes  the  Kronecker  tensor  product.  Hence,  vo  is  a  function  of  the  binary  signature 
vector  (spreading  sequence)  of  the  user  of  interest  do,  the  channel  coefficients  co,o,co,i,  ■  •  •  ;co,iV-i,  and  the 
corresponding  angles  of  arrival  #o,cn  #o,i>  •  •  • ,  $o,iV-i-  While  the  spreading  sequence  is  assumed  to  be  known 
to  the  receiver,  the  channel  coefficients  and  the  angles  of  arrival  are  in  general  unknown. 

Channel  estimation  with  one  or  multiple  sensors  is  a  subject  that  has  been  researched  extensively  over 
the  past  several  years.  In  [42],  the  channel  coefficients  of  a  DS-CDMA  link  are  estimated  in  the  absence  of 
multipaths  based  on  the  cyclostationarity  features  of  the  received  signals  (see  also  [43]  where  the  receiver  is 
equipped  with  multiple  sensors  and  maximum-likelihood  (ML)  channel  estimation  is  attempted  by  modeling 
the  interference  as  colored  Gaussian  noise).  In  [44],  ML  estimation  of  multipath  channels  is  pursued  again 
using  multiple  sensors  at  the  receiver.  Both  ML  estimation  methods  of  [43]  and  [44] ,  however,  are  supervised 
(they  require  knowledge  of  the  transmitted  information  bits).  A  subspace  method  for  direct  estimation  of 
the  MMSE  filter  (without  intermediate  channel  estimation)  is  presented  in  [45].  Identifiability  results  on 
subspace-based  channel  estimation  are  derived  in  [46].  In  [47],  a  constrained  adaptive  algorithm  is  presented 
that  combines  all  multipath  components  and  minimizes  the  multiuser  interference  at  the  receiver  output. 
However,  this  algorithm  (as  well  as  the  methods  in  [43]  and  [45])  has  satisfactory  performance  only  when 
large  data  records  are  utilized  for  the  training  of  the  estimator.  Based  on  a  genetic  algorithm,  an  iterative 
method  for  joint  channel  estimation  and  symbol  detection  is  presented  in  [48].  The  reduced-rank  multistage 
Wiener  filter  of  [49]  is  used  for  channel  estimation  purposes  in  [50].  It  has  been  shown,  however,  that  the 
AV  filter  estimators  presented  in  the  previous  section  outperform  significantly  in  MS  filter  estimation  error 
the  reduced-rank  multistage  Wiener  filters  in  short-data- record  situations  [33].  Finally,  least-squares-based 
supervised  channel  estimation  for  long-coded  CDMA  systems  (which  is  not  covered  in  the  present  work)  is 
performed  in  [51]. 

In  this  section,  we  are  interested  in  carrying  out  blind  (non-supervised)  multipath  S-T  channel  estimation 
when  only  a  short  input  data  record  of  size  M  is  available  for  the  preparation  of  the  estimate.  To  estimate  the 
channel  coefficients  Co  =  [co,o,  cop, . . . ,  co,jv-i]  and  the  angles  of  arrival  0 o  =  [#o,0;  #o,i;  •  •  • ,  #o,jv-i]  for  the 
user  of  interest  0  from  the  S-T  packet  data  record  xo,  xi, . . . ,  xm_ i,  we  design  a  subspace-based  estimation 
procedure.  This  procedure  can  be  viewed  as  a  generalization  of  the  algorithm  in  [36]  for  an  antenna-array 
set-up.  An  additional  modification  is  introduced  that  increases  the  rank  of  the  noise  subspace. 

The  rank  rs  of  the  signal  subspace  of  the  received  data  vectors  x  can  be  controlled  by  considering  one¬ 
sided  or  two-sided  truncations  of  x  (the  latter  eliminates  inter-symbol- interference  (ISI) ) .  The  possible  values 
of  rs,  depending  on  the  data  formation  of  choice,  are  as  follows. 

(i)  No  truncation:  Data  dimension  =  Z(L  +  N  —  1),  2 K  +  1  <  rs  <  3 K. 

(ii)  One-sided  truncation:  Data  dimension  =  ZL,  2 K  <rs<  3 K  —  1. 

(in)  Two-sided  truncation:  Data  dimension  =  Z(L  —  N  +  1),  K  <  rs  <  2 K  —  1. 

To  have  a  guaranteed  minimum  rank  of  the  noise  subspace  of  Z(L  —  N  + 1)  —  (2K  —  1),  we  decide  to  truncate 
x  from  both  sides  (Case  (iii) )  as  shown  in  Fig.  2.2  and  we  form  the  “truncated”  received  vector  xtr  of  length 
Z(L  —  N  +  1)  as  follows: 

■  r((IV-l)Tc) 

tr  <ntc) 

x  = 

.  r((L  -  1  )TC) 


2For  the  sake  of  mathematical  accuracy,  v0  =  y-j(SL  ^2^=0  c°>n 
definition  in  (11).  The  positive  multiplier  is  dropped  from  (23)  as  inconsequential. 


0...0  d()  0...0 


(2.24) 


©  a0,n  according  to  the 
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Then,  with  respect  to  the  ith.  information  bit  of  user  0 ,  xjr  can  be  expressed  as 

xf  =  &o(*)^A0B(6>o)co  +  MAI;  +  n;  (2.25) 

Ju 

where  MAI;  accounts  comprehensively  for  S-T  multiple-access-interference  of  rank  rs  —  1,  B(0O)  is  a  block 
diagonal  matrix  of  the  form  B(#o)  =  diag  (ao,o,  aoq, . . . ,  ao,jv-i) ,  and  Ao  =  Ag  ©  I z  where  Iz  is  the  Z  x  Z 
identity  matrix  and 

'd0[N-l]  d0[N-2]  ...  do  [0] 

d0[JV]  do[N-l]  ...  d0[l] 

Ao=  :  : 

_d0[L-l]  do[L  —  2]  ...  d0[L-N] 

Let  Rtr  =  E  |xtrxtrff  |  be  the  autocorrelation  matrix  of  xtr.  We  form  a  sample-average  estimate 

M  —  l 

i  =  *y*y"  (2-27) 

i=  0 

based  on  the  truncated  M  available  S-T  input  vectors  xjr,  i  =  0, 1, . . . , ,.M  —  1.  If  Rtr  =  QAQH  represents 
the  eigen-decomposition  of  Rtr  where  the  columns  of  Q  are  the  eigenvectors  of  Rtr  and  A  is  a  diagonal 
matrix  consisting  of  the  eigenvalues  of  Rtr,  then  we  use  the  eigenvectors  that  correspond  to  the  Z(L  — 
N  +  1)  —  (2 K  —  1)  smallest  eigenvalues  to  define  our  estimated  noise  subspace.  Let  the  matrix  U„  of  size 
[Z(L  —  N  +  1)]  x  [Z{L  —  N  +  1)  —  (2K  —  1)]  consist  of  these  “noise  eigenvectors.”3  We  propose  to  estimate 
c0  and  O0  indirectly  through  an  estimate  of  the  ZN  x  1  vector 

h0  =  B(0o)co.  (2.28) 

We  estimate  ho  as  the  vector  that  minimizes  the  norm  of  the  projection  of  the  signal  of  the  user  of  interest 
0,  A0B(6»0)  Co  =  Aoho,  onto  the  estimated  noise  subspace  U„  [36],  [52]: 

h0  =  argmin  (A0h0)ffUn  subject  to  h0  =  1.  (2.29) 

h0 

The  solution  to  this  constrained  minimization  problem  is  given  by  the  following  proposition.  The  proof  is 
rather  straightforward  and,  therefore,  omitted. 

Proposition  2.1  The  vector  ho  that  minimizes  the  norm  of  the  projection  of  the  signal  of  user  0  to  the 
estimated  noise  subspace  Un  subject  to  the  norm  constraint  ||ho|]  =  1  is  the  eigenvector  that  corresponds  to 
the  minimum  eigenvalue  of  A^U„U^Ao.  □ 

After  obtaining  ho,  we  may  extract  the  desired  vectors  Co  and  Oo  by  applying  least-squares  (LS)  fitting  to 
ho-  Then,  the  S-T  RAKE  filter  estimate  vo  is  completely  defined  by  (2.23). 

Since  the  S-T  channel  estimation  method  described  above  in  Proposition  2.1  is  based  on  a  blind  second- 
order  criterion,  the  phase  information  is  absorbed  and  the  estimate  vo  is  phase  ambiguous.  Inherently, 
adaptive  filter  estimators  that  utilize  a  phase  ambiguous  RAKE  filter  estimate  are  also  phase  ambiguous.  In 
the  rest  of  this  section,  we  deal  with  the  recovery  (correction)  of  the  phase  of  S-T  linear  filters  when  the  S-T 
RAKE  vector  vo  is  known  within  a  phase  ambiguity. 

Without  loss  of  generality,  let  vq  denote  a  phase  ambiguous  version  of  vq,  i.e. 


(2.26) 


voe-7^  =  v0 


(2.30) 


3  The  definition  of  U„  requires  knowledge  of  the  number  of  active  users  K.  If  K  is  not  known,  it  can  be  either 
estimated  adaptively  from  the  eigenvalue  profile  of  Rtr  or  set  to  the  maximum  possible  value  K  =  Amax  allowed  by 
the  network  protocol  for  capacity  controlled  systems.  The  latter  conservative  option  K  =  Amax  is  quite  appealing 
due  to  its  simplicity  and  effectiveness;  system  performance  is  not  affected  significantly  since  in  practical  applications 
the  S-T  product  Z  (L  —  N  +  1)  is  much  larger  than  2A'max  —  1. 
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where  ip  is  the  unknown  phase.  We  consider  the  class  of  linear  filters  w  e  Cz< L+N  1)  are  functions  of 
the  S-T  RAKE  vector  vo  and  share  the  following  property: 

w(v0)  =  w(-v0)eJ,p.  (2.31) 

Such  filters  include:  (i)  the  S-T  RAKE  filter  itself  v0,  (ii)  the  S-T  MMSE/MVDR  filter  of  (2.12)  or  (2.13), 
and  (Hi)  the  auxiliary-vector  sequence  of  S-T  filters  {w„}  defined  by  eqs.  (2.15)-(2.18).  Given  vo,  we  attempt 
to  correct  the  phase  of  w(vo)  directly.  The  selection  criterion  for  the  phase  correction  parameter  ip  that  we 
propose  in  this  work  is  the  minimization  of  the  mean-square-error  (MSE)  between  the  phase  corrected  S-T 
filter  processed  data  [w(vo)e-7^]  x  and  the  desired  information  bit  bo  (Fig.  2.3): 


arg  min  E 

ip 


bo 


Ip  £  [— 7T,7r). 


(2.32) 


The  following  proposition  identifies  the  optimum  phase  correction  value  according  to  our  criterion. 


Proposition  2.2  The  phase  correction  value 

ip  =  angle  {w(v0)HE  {x60}}  (2.33) 

minimizes  the  MSE  between  the  phase-coiTected  S-T  filter  processed  data  [w(v0)e-7^] H  x  and  the  desired 
information  bit  bo  ■  □ 

In  essence,  Proposition  2.2  suggests  to  project  the  phase  ambiguous  filter  w(vo)  onto  the  ideal  S-T 
RAKE  filter  vo  =  E{xbo}.  However,  E{xbo}  is  certainly  not  known.  Since  we  have  assumed  that  a  pilot 

information  bit  sequence  of  length  Mp  is  included  in  each  packet,  the  expectation  E{xbo}  can  be  sample- 

average  estimated  by  xi^o(i)  where  bo(j),  j  =  1,2,...,  Mp,  is  the  jth  pilot  information  bit  and  x^ 

is  the  corresponding  S-T  input  data  vector.  Then,  the  phase-corrected  adaptive  filter  estimate  is  given  by 

w(v0,R)e^,  ip  =  angle  /  w(v0,  R)ff 


Mp 

E 

3  =  1 


xjMi) 


(2.34) 


We  note  that  if  the  autocorrelation  matrix  R  were  known  exactly,  the  phase  correction  estimate  for  the  filter 
w(vo,R)  =  would  become  ip  =  angle  jv^R-1  JEE  xj^o(i)j  |  which  is  also  the  ML  estimate 

of  ip  when  a  data  record  {(xi,&o(l))>  (x2,&o(2)),  •••,  (xm,, bo(Mp))}  is  available  and  the  interference- 
plus-noise  contribution  to  the  received  vector  x^  is  modeled  as  a  colored  Gaussian  vector.  This  result  -and 
phase  correction  proposal-  is  in  sharp  contrast  to  previous  work,  [39],  [53],  that  ignored  the  interference  or, 
equivalently,  treated  the  interference-plus-noise  contribution  as  a  white  Gaussian  vector  leading  to  the  ML 
estimate  ip  =  angle  jv^  [Ejffi  Xj&oC?)]  j  . 

Since  M  represents  the  packet  size  of  the  DS-CDMA  system  and  Mp  is  the  number  of  mid-amble  pilot 
information  bits  per  packet,  the  ratio  quantifies  bandwidth  waste  due  to  the  use  of  the  pilot  bit  sequence. 
Ideally,  is  to  be  kept  small.  As  we  will  see  in  Section  2.5,  a  few  pilot  bits  (of  the  order  of  5  bits)  are 
sufficient  for  effective  recovery  of  the  filter  phase.  As  a  numerical  example,  when  the  packet  size  is  set  at 
M  =  256  and  Mp  =  5  is  chosen,  the  wasted  bandwidth  is  only  ~  2%. 


2.5  Capacity,  Throughput,  and  Delay  Analysis 

So  far,  we  concentrated  on  the  design/estimation  of  the  receiver  filter  w  in  (2.10).  Our  filter  estimate  is 
based  on  the  AV  algorithm  combined  with  subspace-based  techniques  and  supervised  phase  correction  for 
S-T  channel  estimation  purposes.  The  proposed  overall  AV  filter  estimate  is  generated  adaptively  on  an 
individual  packet-by-packet  basis.  All  M  received  vectors  of  the  packet  are  utilized  for  the  design  of  w 
(estimation  of  R,  vo,  and  the  number  of  AV’s  no),  while  Mp  of  them  are  also  used  for  supervised  phase 
correction  (estimation  of  ip).  Then,  the  M  —  Mp  information  bits  of  user  0  associated  with  the  remaining 
M  —  Mp  received  vectors  are  detected  by  (2.10). 
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The  effectiveness  of  the  filter  is  characterized  statistically  by  the  probability  distribution  of  the  number 
of  bit  errors  in  a  packet: 


p(i)  =  Pr  {*  bit  errors  in  the  packet}  ,  i  =  0, 1, . . . ,  M  —  Mp.  (2.35) 

W.l.o.g.  we  define  the  bit-error-rate  (BER)  as  the  probability  of  erroneous  detection  of  &o(0)  (the  first  bit 
of  user  0  in  the  packet): 


BER  =  Pr 


|So(0)  7^  fro(0)  j  =  Pr  j&o(0)  7^  &o(0)  i  bit  errors  in  the  packet  jp(i) 


i= 0 

M  —  Mp  .  M  —  Mp 

P(0  =  17  1  n,r  E  **>(*)■ 


(2.36) 


=  E 


M  -  Mv 

i= 0  p 


M  -M, 


p 


i=0 


It  is  interesting  to  note  at  this  point  that  if  the  filter  w  were  independent  of  the  information  bit  stream  6o(0), 
&o(l)i  *  ■  ■  j  then  the  BER  could  have  been  expressed  analytically  as  a  weighted  sum  of  the  value  of  the  error 
function  Q(x)  =  f^e*  / 2dt  at  2A_1  different  points  [39].  However,  this  independence  assumption  does 
not  hold  true  in  our  system  since  the  packet  data  record  xo,  xi,  . . . ,  xm_  i  (that  includes  the  information 
bits  6o(0),  &o(l);  •  •  • ,  bo(M  —  1)  to  be  detected)  is  directly  utilized  for  the  calculation  of  w.  Therefore,  use 
of  the  analytical  BER  expression  is  not  appropriate  in  our  performance  evaluation  studies  for  which  we  can 
only  rely  on  (2.36). 

A  data  packet  is  received  successfully  within  a  single  time  slot  if  the  number  of  errors  in  the  detection 
of  the  M  —  Mp  information  bits  is  less  than  or  equal  to  the  maximum  number  of  (correctable)  bit  errors 
allowed  by  the  forward-error-correction  (FEC)  module.  If  no  FEC  is  present,  then  the  packet  is  successfully 
received  when  all  M  —  Mp  information  bits  are  detected  correctly.  The  packet-error-rate  (PER)  is  defined 
as  the  probability  of  receiving  an  uncorrectable  packet  within  a  single  time  slot  and  is  given  by 

M—Mp  h 

PER(ft)  =  Pr  {more  than  h  bit  errors  in  the  packet}  =  E  p(i)  =  i-E*o  (2-37) 

i  h  ■  \  i=  0 

where  h  is  the  maximum  number  of  correctable  bit  errors  per  packet.  By  setting  h  =  0  we  obtain  the  PER 
of  a  system  without  FEC:  PER(O)  =  i  Mp  p(i)  =  1  —  p(0). 

To  examine  the  PER  performance  of  the  DS-CDMA  system  described  in  Section  2.2  equipped  with  the 
proposed  packet-rate  adaptive  S-T  AV  filter  receiver,  we  proceed  with  an  illustration.  We  consider  dedicated 
Gold  user  signature  assignments  of  length  L  =  31.  We  fix  the  packet  size  at  M  =  256  bits  and  we  use 
Mp  =  5  of  them  as  pilot  mid-amble  bits.  Each  user  signal  experiences  N  =  3  independent  Rayleigh  fading 
paths  with  equal  average  received  energy  per  path  and  independent  angles  of  arrival  uniformly  distributed 
in  (— f ,  § )  .  In  all  of  our  studies  we  consider  averages  over  20,000  independently  drawn  multipath  Rayleigh 
fading  S-T  channels.  The  receiver  antenna  array  consists  of  Z  =  4  elements.  With  these  numbers,  the 
multipath  extended  S-T  product  (or  length  of  the  adaptive  filter)  is  Z(L  +  N  —  1)  =  132.  For  simplicity  in 
our  studies,  we  assume  that  the  packets  of  all  users  arrive  at  the  same  average  total  power  level  (affected  by 
independent  fading  per  packet,  user,  and  path).  The  average  total  received  pre-detection  SNR  of  each  user 
(the  sum  of  the  average  received  SNR’s  over  all  paths)  is  set  to  lldB  where,  following  the  notation  in  (2.7), 
the  average  total  received  SNR  for  user  k,  k  =  0, 1, . . . ,  is  defined  by  2 ^  |lcfc,"|2|  /Ao  (we  recall 

that  A  is  the  AWGN  power  spectral  density  assumed  to  be  identical  for  every  spatial  channel/ antenna 
element). 

In  Fig.  2.4(a)  we  set  the  number  of  users  to  K  =  15  and  we  plot  p(i)  for  various  receivers:  (i)  S-T  RAKE, 
(ii)  SMI  [27],  (Hi)  Auxiliary- Vector,  and  for  reference  purposes  (iv)  the  reduced-rank  orthogonal  multistage 
Wiener  filter  in  [49]  (with  the  preferred  number  of  stages  D  =  8  [54]).  All  filter  estimates  utilize  exactly  the 
same  data  and  for  all  estimates  phase  recovery  with  Mp  =  5  pilot  bits  is  carried  out  according  to  Section  2.4. 
The  AV  and  S-T  RAKE  filters  are  seen  to  be  significantly  less  vulnerable  to  bit  errors  than  their  multistage 
and  SMI  counterparts.  We  also  observe  that  the  probability  mass  function  p(i)  is  practically  zero  for  the 
AV  filter  when  i  >  6.  In  Fig.  2.4(b),  we  concentrate  only  on  the  AV  filter  and  we  plot  p(i)  for  the  cases  of 
K  =  15  and  K  =  30  active  users.  For  K  =  30,  the  probability  mass  is  shifted  gently  to  the  right. 
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In  Fig.  2.5,  we  study  the  BER  performance  of  all  these  receivers  as  a  function  of  the  number  of  active 
users  K  in  the  system.  We  also  add  to  this  study  the  multistage  Wiener  filter  [49]  with  D  =  2  stages 
which  we  found  empirically  to  be  the  best  number  of  stages  for  this  specific  problem.  For  each  receiver 
and  every  value  of  K,  the  BER  is  evaluated  through  (2.36).  We  immediately  observe  the  disappointing 
performance  of  the  SMI  and  multistage  ( D  =  8)  Wiener  filters.  We  also  note  the  disturbing  behavior  of 
the  best  multistage  ( D  =  2)  Wiener  filter  when  K  <  4  due  to  the  implicit  matrix  inversion  implemented 
through  the  backward  recursion  of  the  multistage  algorithm  (for  K  <  4  the  corresponding  reduced-rank 
data  autocorrelation  matrix  estimate  becomes  ill-conditioned  and  its  implicit  inversion  in  the  multistage 
algorithm  leads  to  this  phenomenon).  The  superiority  of  the  AV  filtering  algorithm  over  all  other  schemes 
and  over  the  whole  range  of  the  number  of  users  K  is  apparent. 

In  Fig.  2.6(a),  we  plot  the  PER  as  a  function  of  the  number  of  active  users  K  using  (2.37).  No  FEC  is 
assumed  (h  =  0).  As  expected,  the  relative  PER  performance  results  parallel  the  BER  findings  in  Fig.  2.5. 
Joint  S-T  processing  of  the  antenna-array  received,  multipath  combined  data  has  “increased”  the  effective 
system  processing  gain  from  L  =  31  to  Z(L  +  N  —  1)  =  132;  indeed,  it  is  satisfying  to  observe  that  the  PER 
of  the  S-T  RAKE,  best  multistage  Wiener  filter  (D  =  2),  and  AV  receivers  approaches  1  for  user  numbers 
much  greater  than  31.  In  a  rather  more  interesting  study,  Fig.  2.6(b)  shows  the  PER  under  h  =  4  FEC. 

System  BER/PER  performance  improvements  due  to  the  use  of  the  AV  receiver  under  joint  S-T  data 
processing  allow  us  to  accommodate  more  users  for  a  set  PER  constraint  (or  reduce  the  transmitting  power 
of  the  hand-set  or  increase  the  range/coverage  of  the  base-station  transceiver  for  a  pre-set  maximum  number 
of  active  users).  We  may  express  the  PER  as  a  function  of  both  the  FEC  capabilities  h  and  the  number  of 
active  users  K,  PER(/i,  K).  Then,  we  define  the  user  capacity  C(h)  as  the  maximum  number  of  active  users 
(packets  in  a  slot)  under  which  a  given  PER  constraint  is  satisfied: 

C(h )  =  max  {K  :  PER (h,  K)  <  A}  (2.38) 

where  A  is  the  PER  constraint  threshold.  The  choice  of  the  threshold  value  A  depends  on  the  specific  user 
application  requirements  [9],  [10].  For  example,  real-time  voice  data  cannot  benefit  -in  general-  from  packet 
retransmission  and  may  require  a  A  value  close  to  10-2.  Upholding  of  the  PER  performance  constraint 
requires  that  the  network  controls  the  number  of  active  users  in  the  system  K  and  enforces  K  <  C(h)  at  all 
times. 

We  return  to  the  previous  illustration  to  examine  the  user  capacity  of  the  system  for  the  S-T  adaptive 
receivers  under  consideration.  We  set,  for  example,  the  PER  constraint  threshold  to  A  =  10-2.  From 
Fig.  2.6(a)  we  conclude  that  in  the  absence  of  FEC  the  user  capacity  of  the  AV  system  is  10  and  the  capacity 
of  the  S-T  RAKE  system  is  2.  Significant  capacity  improvement  is  achieved  with  4-bit  FEC  (Fig.  2.6(b)). 
The  AV  system  supports  23  users  and  the  S-T  RAKE  6.  Neither  the  D  =  8  multistage  Wiener  filter  nor 
the  SMI  receiver  can  meet  the  PER  constraint  for  any  number  of  users  K  >  1,  with  or  without  FEC.  The 
best  multistage  Wiener  filter  ( D  =  2)  receiver  can  support  4  to  6  users  when  no  FEC  is  considered  and  3 
to  14  users  with  4-bit  FEC.  However,  it  cannot  meet  the  PER  requirement  when  K  <  2  and  it  has  to  be 
withdrawn  as  an  option  for  the  packet-rate  adaptive  S-T  system  under  consideration.  We  conclude  that  we 
have  two  viable  solutions:  the  AV  and  plain  S-T  RAKE  receiver  systems.  The  capacity  gain  of  the  proposed 
AV  system  is  certainly  clear.  The  AV  system  allows  up  to  23/31~74%  loading  for  this  Gold  coded  system 
with  signature  length  31  and  4-bit  FEC  (with  Z  =  4  antenna  elements  and  multipath  fading  reception  with 
lldB  average  total  pre-detection  SNR  per  user). 

Let  us  now  assume  that  if  a  data  packet  is  received  with  only  correctable  errors  a  positive  ACK  is  sent 
back  to  the  user  over  a  different  downlink  channel  (FDD).  Once  an  uncorrectable  error  is  detected  in  the 
packet,  a  NAK  is  sent  back  to  the  mobile  which  then  retransmits  after  waiting  for  a  random  number  of  time 
slots.  We  denote  the  probability  of  having  K  packets  transmitted  during  a  time  slot  interval  Ts  by  A(K). 
A(K)  depends  on  the  traffic  model.  For  example,  if  potential  users  (packet  contributors)  come  from  an 
infinite  population  and  the  number  of  users  that  attempt  to  transmit  a  packet  during  a  time  slot  is  Poisson 
distributed  with  mean  G  (offered  packet  traffic  load)  [10],  [11],  then  A(K)  =  — ,  K  =  0,1,2,.... 

If,  on  the  other  hand,  potential  users  come  from  a  finite  population  of  size  N  and  each  user  attempts 
independently  to  transmit  a  packet  with  probability  ^  ( G  represents  again  the  offered  traffic  load  per  slot), 

then  A(K)  =  (^)  (^-)A  (l  —  K  ,  0  <  I\  <  N,  [9],  [10],  [12],  [13].  Of  course,  for  large  values  of  N  the 
hard-to-evaluate  binomial  distribution  approaches  quickly  the  Poisson  distribution.  In  either  case,  when  the 
network  system  enforces  a  user  capacity  cap  K  <  C(h)  the  distribution  of  the  number  of  packets  in  a  slot 
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A(K)  remains  unchanged  for  K  =  0, 1, . . .  ,C(h)  —  1,  while  A(I\  =  C(h))  =  1  —  Ylk=o  1  4(fc)  and  4(.ff)  =  0 
for  K  >  C(h). 

Packet  throughput  is  defined  as  the  average  number  of  successful  packet  transmissions  per  time  slot. 
Let  PSR(/i,  K)  be  the  physical  layer  one-shot  packet  success  rate  (that  is  the  probability  that  a  packet  is 
received  successfully  within  a  single  time  slot)  with  h- bit  FEC  in  the  presence  of  K  users.  Then,  PSR (h,  K)  = 
1  —  PER(/i,  K)  and  the  packet  throughput  of  the  system  S  can  be  expressed  as 

oo  oo 

S(h)  =  J2  K  A(K)  PSR (h,  K)  =  K  A[K)  (1  -  PER (h,  K)) .  (2.39) 

K= 1  K= 1 


In  Fig.  2.7,  we  plot  the  packet  throughput  S(h)  as  a  function  of  the  offered  packet  traffic  load  G  for 
binomial  packet  arrivals  (TV  =  33)  without  user  capacity  control.  The  cases  of  no  FEC  ( h  =  0)  and  4-bit 
FEC  ( h  =  4)  are  examined  separately  in  Part  (a)  and  (b),  respectively.  We  see  that  at  the  maximum  traffic 
load  of  33  packets,  ^|4il00%~  77.6%  of  the  packets  in  the  AV  system  go  through  successfully  on  the  average 
when  h  =  0.  This  number  increases  to  ^|^100%~  96.7%  when  h  =  4.  In  Fig.  2.8  we  repeat  the  exact 
same  study  of  Fig.  2.7  for  a  network  system  that  enforces  user  capacity  control  (number  of  packets  K  in 
each  system  slot  is  less  than  or  equal  to  C(h)  to  guarantee  PER(/i,  K)  quality  A  =  10-2  or  better  at  all 
times).  As  expected  (cf.  (2.39)),  the  packet  throughput  curves  converge  quickly  to  C (h)PSR(h,  C (h))  as  G 
increases.  When  h  =  0,  the  limit  throughput  value  is  9.9  packets  per  slot  for  the  AV  system  and  2.0  packets 
per  slot  for  the  S-T  RAKE  system.  When  h  =  4,  the  limit  throughput  values  become  22.8  and  5.9  packets 
per  slot,  correspondingly.  Fig.  2.8  quantifies  the  potential  contribution  of  AV  receivers  in  networks  with  user 
capacity  control.  The  study  of  the  important  trade-off  between  the  PER  quality  constraint  A  and  the  limit 
throughput  C(h)PSR(h,  C(h))  is  beyond  the  scope  of  this  work. 

As  a  final  study,  we  consider  the  network  delay  metric  D (h)  defined  as  the  average  number  of  slots 
from  the  time  a  packet  is  presented  to  the  network  until  it  is  successfully  received.  Markov-chain  analysis 
techniques  lead  us  to  the  following  expression  for  the  network  delay  of  a  system  with  user  capacity  cap  C(h).4 
The  proof  is  given  in  the  Appendix. 


D(h)  =  S0 


Si  [  ^  A(I<)PER(h,  I<)  +  f:A(K)°g>PER(h,C(h)) )  +  52^A(I<)  (l  - 

\if=l  K=C(h)  + 1  )  K=C(h)+ 1  V  2 

1  _  Cjr  A(I\)PER(h,  I\)  -  f:A(K)^PER(h,C(h))  -  £A(K)  (l  - 

K= 1  K=C(h)+ 1  K=C(h)  + 1  V  2 


(2.40) 


where  So,  Si,  and  82  denote,  correspondingly,  the  mean  delay  in  slot  units  for  the  execution  of  (i)  a  single 
transmission  with  successful  reception,  (ii)  a  retransmission  due  to  bit  errors,  and  (in)  a  retransmission  due 
to  rejection  by  the  capacity  controller  (when  more  than  C(h)  packets  are  presented  for  potential  transmission 
during  a  slot).  It  is  easy  to  verify  that  if  we  let  C(h)  — »  00  in  our  delay  expression  in  (2.40)  we  obtain  the 
familiar  delay  expression  for  links  without  user  capacity  control  [9],  [10],  [55]: 

OO 

Ji  £  A(K)¥YR(h,K) 

K= 1 

00 

1  -  E  A(K)PER(h,  K ) 

K= 1 


5l^E{PSR(h,K)}  1 


(2.41) 


Fig.  2.9  presents  the  network  delay  D(h)  as  a  function  of  the  throughput  S(h)  for  a  system  without  user 
capacity  control  and  binomial  (N  =  33)  arrivals.  The  mean  retransmission  delay  is  set  to  di  =  3  slots  and 
the  mean  successful  transmission  delay  is  do  =  1.5  slots.5  No  FEC  ( h  =  0)  is  examined  in  Fig.  2.9(a)  and 
4-bit  FEC  is  examined  in  Fig.  2.9(b).  As  desired  for  packetized  voice  applications,  the  network  delay  plots 
indicate  that  the  system  maintains  a  relatively  constant  delay  as  the  throughput  approaches  its  maximum 

4This  delay  analysis  assumes  a  simple  stop-and-wait  ARQ  protocol.  Certainly,  for  large  loads  the  transmission 
delay  may  be  reduced  by  means  of  a  more  efficient  selective  retransmission  protocol  which  enables  continuous  trans¬ 
mission. 

6It  is  assumed  that  the  minimum  delay  for  slotted  systems  is  1.5  slots  [10]  which  includes  the  average  time  interval 
between  the  presentation  of  a  packet  to  the  transmitter  and  the  beginning  of  the  next  slot. 
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value  (at  G  =  N  =  33).  In  Fig.  2.10,  we  repeat  the  study  of  Fig.  2.9  for  a  system  that  enforces  capacity 
control  with  mean  “rejection  delay”  S 2  =  1  slot.  The  plots  capture  again  the  possible  throughput-delay 
operating  points  and  help  us  visualize  how  the  delay  D(h)  in  (2.40)  increases  when  the  capacity-controlled 
throughput  S(h)  converges  to  C(h)PSR(h,  C(h))  for  high  traffic  loads  G. 

2.6  Summary  and  Conclusions 

We  considered  a  random-access  wireless  DS-CDMA  multipath  fading  AWGN  link  with  dedicated  signature  as¬ 
signments.  We  suggested  packet  length  design  that  is  sufficiently  short  to  conform  with  the  coherence  time  of 
the  link  (quasi-static  fading  over  the  duration  of  a  packet).  Then,  we  developed  a  complete  adaptive  antenna 
array  CDMA  linear  filter  receiver  design  that  adapts  itself  and  detects  the  transmitted  information  bits  on 
an  individual  packet-by-packet  basis.  Effective  short-data- record  (packet-rate)  filter  adaptation/estimation 
and  bit  detection  was  made  possible  due  to  the  proposed  auxiliary- vector  (AV)  filtering  algorithm.  The  lat¬ 
ter  incorporated  seamlessly  packet-rate  blind  subspace-based  space-time  channel  estimation  and  supervised 
recovery  of  the  space-time  channel  phase  through  the  use  of  a  few  packet  mid-amble  pilot  bits.  Our  studies 
showed  that  very  limited  mid-amble  pilot  signaling  (of  the  order  of  5/256~2%)  can  be  sufficient  for  phase 
recovery  and  effective  adaptive  receiver  design.  Therefore,  differential  modulation  to  overcome  the  phase 
ambiguity  problem  is  not  a  necessity. 

Bit-error-rate,  packet-error-rate,  and  user  capacity  studies  and  comparisons  demonstrated  state-of-the- 
art  performance  of  the  proposed  linear  adaptive  AV  receiver  system.  Through  the  development  of  the 
probability  mass  distribution  of  the  bit  errors  in  a  packet  we  were  able  to  translate  these  findings  to  packet 
throughput  results  (with  or  without  user  capacity  control).  We  found  for  example  that  for  a  multipath 
Rayleigh  fading  AWGN  link,  lldB  average  total  received  pre-detection  SNR  per  user,  four  antenna  ele¬ 
ments,  and  4-bit  FEC  the  proposed  adaptive  AV  receiver  system  offers  22  gjj5,9 100%~  286%  improvement 
in  maximum  packet  throughput  with  user  capacity  control  over  the  corresponding  system  that  employs  con¬ 
ventional  space-time  RAKE  reception.  Network  delay  derivations  and  studies  showed  a  slow,  well-behaved 
increase  of  the  AV  system  delay  for  throughput  up  to  its  maximum  value.  In  this  context,  the  packet-rate 
adaptive  receiver  design  developed  using  the  AV  filtering  principles  coupled  with  FEC  techniques  seems  to 
provide  an  effective  tool  to  improve  the  performance  of  random  access  DS-CDMA  mobile  communication 
links. 


2.7  Appendix 


We  denote  by  R\  and  R2  the  total  number  of  retransmissions  of  a  packet  until  successful  reception  due  to 
bit  errors  and  due  to  rejection  by  the  capacity  controller  (when  more  than  C(h)  packets  are  presented  for 
potential  transmission  during  a  slot),  respectively.  The  joint  probability  mass  function  (pmf)  of  Ri  and  R2 
is  given  by 

Pr{Ri  =  i,  R2  =  j}  =  (^+j^p\pJ2{l  -  pi  -p2),  i,j  =  0,1,2,...,  (2.42) 

where  pi  and  P2  denote  the  probability  that  a  retransmission  is  needed  due  to  bit  errors  and  due  to  rejection 
by  the  capacity  controller,  respectively: 


C(h) 


Pi 


=  Y  A(K)PER(h,  K)  +  Y  A(K)  — C{h)) 
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From  (2.42),  the  marginal  pmf  of  R\  is 


Pr  {Rx  =  1}  =  J^Pr  {Ri  =  i,  R2  =  j}  =  p\  (1  -  Pi  -  Pi)  Y 
3= 0  i= 0 
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Fig.  2.1.  Data  packet  structure  of  total  length  M  bits  that  contains  a  mid-amble  of  Mp  pilot  bits. 


For  any  pi  E  (0, 1),  p?2  =  we  differentiate  both  sides  of  this  equality  i  times,  i  =  0, 1, 2, . . .,  to 

obtain  J2Zo  =  n_E+i  or 
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From  (2.45)  and  (2.46)  we  conclude  that 


Pr{f?i  =i}  =  p\(l-Pl  -p2) 
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Therefore,  R\  is  geometrically  distributed  with  parameter  ^  E  (0, 1)  and  mean 

pi 


i  =  0,1,2,... 


e{ri}=TZ 


1—P2 


Pi 


1—P2 


1  -  Pi  -  P‘2 


Similarly,  following  the  same  derivation  steps  for  the  random  variable  R2  we  conclude  that 

E{R2}  =  - — p— 

1  -  Pi  -  P2 

Then,  the  network  delay  is  evaluated  as  follows: 


D(h)  =  S0  +  5\E  {iii}  +  S2E{R2}  =  S0  +  a,- - - +  52 - - ^ - =  <50  +  \lpl  +  §2p2 

1  —  Pl  —  P2  1  —  Pi  —  p2  1  —  Pi  —  P2 

which  coincides  with  (2.40)  (pi  and  p2  are  given  by  (2.43)  and  (2.44),  respectively). 


(2.46) 


(2.47) 


(2.48) 


(2.49) 


(2.50) 


43 


Xj  of  length  Z(L+N-1) 


b(i-l) 
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i  b(i) 

1 
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i  1 
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b(i-l) 

i 

b(i) 

1 
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1 

-  X  [r  of  length  Z(L-N+1)  — 

J ISI 

Fig.  2.2.  Data  collection  and  ISI  trimming. 


Fig.  2.3.  Phase  correction  for  the  S-T  linear  filter  w(vq). 
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Bit  error  distribution  p(i)  Bit  error  distribution  p(i) 


(b) 


Fig.  2.4.  Distribution  p(i)  of  number  of  bit  errors  per  packet  for  (a)  the  S-T  RAKE,  SMI,  multistage 
Wiener  filter  (D  =  8),  and  AV  receivers  in  the  presence  of  K  =  15  users  and  (b)  the  AV  receiver 
when  K  =  15  or  K  =  30. 


Fig.  2.5.  BER  versus  number  of  active  users  K  for  the  SMI,  multistage  Wiener  filter  with  the 
preferred  number  of  stages  (D  =  8),  S-T  RAKE,  best  multistage  Wiener  filter  ( D  =  2),  and  AV 
receivers. 
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Packet-  E  rror-Rate  Packet-  Error-  Rate 


(b) 


Fig.  2.6.  PER  versus  number  of  active  users  K  for  a  system  with  (a)  no  FEC  and  (b)  4-bit  FEC 


Offered  Traffic 


Fig.  2.7.  Packet  throughput  versus  offered  packet  traffic  load  without  capacity  control:  (a)  no  FEC, 
(b)  4-bit  FEC. 


Offered  Traffic 


Fig.  2.8.  Packet  throughput  versus  offered  traffic  with  capacity  control  ( K  <  C(h)):  (a)  no  FEC, 
(b)  4-bit  FEC. 


0  5  10  15  20  25  30 

Throughput 


Fig.  2.9.  Network  delay  (in  slots)  versus  throughput  for  a  system  without  capacity  control:  (a) 
FEC,  (b)  4-bit  FEC. 
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Chapter  3 


Fast  converging  minimum  probability 
of  error  neural  network  receivers  for 
DS-CDMA  communications 

3.1  Introduction 

Direct-sequence  code-division-multiple-access  =(DS-CDMA)  systems  have  received  considerable  interest  in 
response  to  an  ever-increasing  demand  for  better  utilization  of  the  available  resources  in  communications 
environments.  In  DS-CDMA  communication  systems,  the  target  operation  of  the  receiver  is  the  detection 
of  the  transmitted  information  bits  of  one  (mobile-end)  or  more  (base  station)  users. 

For  the  case  of  single-user  detection  in  DS-CDMA  systems,  the  optimum  decision  boundary  for  the  addi¬ 
tive  white  Gaussian  noise  (AWGN)  multiple- access  channel  is  nonlinear  [1],  As  such,  multi-layer  perceptron 
(MLP)  neural  networks  (NNs)  are  particularly  suitable  for  the  detection  of  DS-CDMA  signals  since  they  are 
capable  of  approximating  arbitrary  hypersurfaces  in  the  input  space.  MLPs  as  well  as  other  neural  network 
structures  for  single  or  multi-user  detection  were  considered  in  [2]- [11].  Independently  of  the  design  optimal¬ 
ity  criteria  that  are  used  to  determine  the  parameters  of  the  receiver  structure,  the  ultimate  performance 
measure  of  interest  is  the  probability  of  error  in  detecting  the  transmitted  information  bit  of  each  user  (also 
referred  to  as  bit-error-rate  or  BER).  This  is  exactly  the  motivation  for  this  present  work. 

In  recent  literature,  minimum  BER  (MBER)  optimization  in  the  context  of  linear  receiver  structures  was 
considered  in  [12]- [14]  where  stochastic  gradient  adaptive  algorithms  were  proposed  that  approximate  the 
linear  minimum  BER  single-user  DS-CDMA  receiver.  The  work  in  [14]  was  extended  in  [15]  to  a  stochastic 
gradient  adaptive  “least  error  rate”  algorithm  for  training  two-class  radial  basis  function  (RBF)  network 
classifiers.  A  Newton-type  recursive  minimum-BER  linear  detector  that  requires  knowledge  of  all  system 
users  and  noise  statistics  was  reported  in  [16]. 

Motivated  by  the  inherent  nonlinearity  of  the  single  user  optimum  decision  boundary  for  AWGN  multiple 
access  channels,  in  this  chapter  we  focus  on  MLP  neural  networks  and  their  potential  application  to  single- 
user  detection.  We  assume  a  synchronous  DS-CDMA  system  and  we  consider  the  problem  of  single-user 
detection  in  the  presence  of  unknown  multiuser  interference  and  additive  channel  noise.  Our  receiver  structure 
of  choice  consists  of  an  adaptive  MLP-NN  filter  front-end  followed  by  a  sign  detector.  There  are  three  key 
contributions  in  this  chapter  that  are  summarized  below,  (i)  The  NN  parameters  (weights  and  biases)  are 
adapted  online  in  a  way  that  minimizes  directly  the  induced  probability  of  error.  Although  an  analytic  closed- 
form  NN  filter  solution  for  the  adopted  minimum-probability-of-error  criterion  is  certainly  not  attainable  in 
general,  this  mathematical  intractability  is  bypassed  using  results  from  Stochastic  Approximation  theory 
that  allow  recursive  probability  of  error  minimization.  Relative  to  other  previously  proposed  interference 
suppression  approaches,  the  major  advantage  of  a  minimum  probability  of  error  scheme  is  the  incorporation 
of  the  performance  evaluation  measure  of  choice  directly  into  the  learning  process,  (ii)  Based  on  two  key 
symmetry  properties  exhibited  by  the  optimum  decision  boundary  of  the  user  of  interest  we  identify  two 
principal  characteristics  of  the  MLP  structure  under  consideration  and  translate  them  into  constraints  that 
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are  incorporated  in  the  MBER  design  procedure  of  the  network  weights  and  biases.  The  end  result  is  a 
significant  reduction  of  the  number  of  the  network  parameters  to  be  trained  which  implies  faster  network 
convergence.  We  note  that  the  NN  constraints  are  independent  of  the  specifics  of  MBER  optimization 
and  can  be  adopted  by  any  MLP  training  algorithm  to  accelerate  convergence  (e.g.  [2]- [1 1] ) .  (in)  We 

embed  importance  sampling  (IS)  principles  directly  into  the  receiver  optimization  process.  This  is  in  sharp 
contrast  to  the  traditional  use  of  IS  for  system  performance  evaluation  only.  In  addition  and  contrary  to  the 
traditional  IS  assumption  of  known  input  statistics,  the  proposed  scheme  does  not  require  such  knowledge. 
The  proposed  method  utilizes  only  a  short  pilot  sequence  and  in  effect  creates  virtual  high  BER  receiver 
conditions  that  accelerate  the  adaptation  (training)  process. 

The  rest  of  the  chapter  is  organized  as  follows.  In  Section  3.2  we  present  briefly  the  DS-CDMA  signal 
model  and  we  discuss  the  properties  of  the  optimum  single-user  decision  boundary.  Constrained  neural  net¬ 
work  optimization  and,  in  particular,  constrained  minimum  BER  optimization  are  the  subjects  of  Sections  3.3 
and  3.3,  respectively.  Importance-sampling-accelerated  MBER  neural  network  optimization  is  discussed  in 
Section  3.5.  Simulation  studies  that  illustrate  the  merits  of  the  proposed  scheme  are  included  in  Section  3.6. 
Finally  a  few  concluding  remarks  are  drawn  in  Section  3.7. 


3.2  Optimum  Single-User  Detection 


We  consider  a  A'-user  synchronous  DS-CDMA  communication  system  with  processing  gain  A.  Extension 
to  asynchronous  systems  is  straightforward1.  We  assume  that  each  user  is  assigned  a  normalized  signature 
vector  Sfc,  k  =  0,1,. . . ,  AM.  The  binary  antipodal  (+1  or  —1)  information  bits  are  assumed  to  be  independent, 
identically  distributed  and  equiprobable.  After  conventional  chip-matched  filtering  and  chip-rate  sampling, 
the  baseband  received  signal  vector  over  one  information  bit  period  takes  the  form 

K- 1 

r  =  y/Ekdk sk  +  n  (3.1) 

k= 0 

where  Ef.  and  <4  denote  the  bit  energy  and  the  information  bit  of  user  k,  respectively.  The  vector  n  represents 
AWGN  with  mean  0  and  auto  covariance  matrix  ct2Il  where  a2  is  the  channel  noise  variance  and  II  is  the 
Ax  A  identity  matrix. 

Using  the  total  probability  theorem  we  can  show  that  the  probability  density  function  (pdf)  /( r)  of  the 
random  vector  r  in  (3.1)  is  a  mixture  of  2K  multivariate  Gaussian  distributions,  i.e. 
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(3.2) 


where  is  the  bit  of  the  fcth  user  in  the  ith  possible  bit  combination,  i  =  1, . . . ,  2A  [1].  The  minimum 
BER  detector  of  the  user  of  interest  ( user  0)  decides  in  favor  of  the  information  bit  do  that  maximizes  the 
conditional  pdf  of  the  received  vector  r  given  that  do  was  transmitted  [1]: 

d0  =  arg  max  /rMo  (r).  (3.3) 

do£{±l} 

By  considering  the  particular  form  of  /( r)  in  (3.2)  we  may  rewrite  the  optimum  DS-CDMA  detector  of  (3.3) 
in  the  following  way: 
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JFor  the  case  of  one-shot  detection,  an  asynchronous  DS-CDMA  system  with  K  users  can  be  modeled  as  a 
synchronous  system  with  2K  —  1  virtual  users  [17]. 
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The  optimum  (minimum  BER)  single-user  detector  in  (3.4)  requires  knowledge  of  the  energy  terms  E±,  E%, 
. . .,  Ek-i  and  signatures  si,  S2,  . . . ,  s^-i  of  all  interfering  users  as  well  as  knowledge  of  the  noise  power  <r2 
(Eq  and  So  are  also  supposed  to  be  known).  The  computational  complexity  of  the  decision  rule  in  (3.4)  is, 
in  general,  exponential  in  the  number  of  users  K. 

As  a  general  comment,  the  optimum  single-user  detector  in  (3.4)  is  nonlinear  and  the  optimum  decision 
boundary  is  a  hypersurface  whose  characteristics  are  determined  by  all  user  signatures  and  energies  as  well 
as  the  additive  noise  power.  The  single-user  optimum  decision  boundary  does  have,  however,  some  proper¬ 
ties  that  are  not  system-specific  and  can  be  useful  in  the  design  of  MLP  neural  networks  that  attempt  to 
approximate  this  boundary  [18].  For  the  sake  of  completeness,  we  repeat  below  Part  (i)  of  Proposition  3.1 
in  [18]  which  will  serve  as  the  basis  for  the  developments  of  this  work. 


Proposition  3.1  Let  B  be  the  set  of  all  vectors  in  that  define  the  optimum  single-user  decision  boundary: 


B  =  {r  G  Rl  :  /r|d0=i(r)  =  /r|d0=-i(r)}. 


(3.5) 


Then, 

(i)  0  £  B  and 

(ii)  the  elements  of  B  are  pairwise  opposite,  i.e. 

r  G  B  if  and  only  if  —  r  e  B.  (3-6) 

□ 

Proposition  3.1  essentially  implies  that  the  optimum  decision  boundary  passes  through  the  origin  0  and 
is  also  symmetric  with  respect  to  the  origin.  This  property  is  independent  of  the  system  setup  and  can  be 
useful  in  practice  when  the  system  parameters  are  unknown  and  the  optimum  decision  boundary  has  to  be 
approximated  (estimated).  In  the  next  section,  we  exploit  this  property  in  the  form  of  a  set  of  constraints 
that  will  be  incorporated  in  the  receiver  adaptation  process. 


3.3  Constrained  NN  Optimization 

We  consider  a  3-layer  (input  layer,  hidden  layer,  and  output  layer)  feedforward  fully  connected  MLP-NN 
with  an  odd  number  of  hidden  neurons  and  a  single  output  neuron  followed  by  a  sign  detector  (Fig.  3.1). 

The  input  and  hidden  layers  consist  of  L  and  2 H  +  1  neurons,  respectively.  With  respect  to  the  hidden 
layer,  Wij  denotes  the  weight  associated  with  the  connection  between  the  jth  input  neuron  and  the  it h 
hidden  neuron,  j  =  1,  2,  . . . ,  L,  i  =  —H,  (-H- (-1) ,  . . .,  0,  1,  . . .,  H ,  while  bt  denotes  the  bias  associated  with 
the  ith  hidden  neuron,  i  =  —H,  (-H+ 1),  . . .,  0,  1,  . . .,  H.  With  respect  to  the  output  layer  (single  output 
neuron),  v,;  is  the  weight  associated  with  the  connection  between  the  ith  hidden  neuron  and  the  output 
neuron,  i  =  —H,  (-H+ 1),  . . .,  0,  1,  . . .,  H,  while  bout.  is  the  bias  of  the  output  neuron.  We  assume  that  all 
(hidden  and  output)  neurons  employ  the  same  (symmetric)  activation  function  ip  that  exhibits  the  following 
properties:  (i)  <p(x)  >  0  for  x  >  0  and  (ii)  tp(—x)  =  —<p( x)  for  any  x.  For  notational  simplicity  we  combine 
all  weights  associated  with  the  connections  between  all  input  neurons  and  a  single  hidden  neuron  i  to  form 
the  weight  vector  w,:  =  [u^i  Wi p.  . . .  Wi^]T ,  i  —  — H ,  (—17+1),  . . .,  0,  1,  . . . ,  H. 

Let  z  =  [w_Tw,  wTh+1,  . . . ,  w£,  b.H ,  b.H+ 1, .  ..,bH,  v.„,  v.H+i,  bout]  be  the  vectorized  form  of 

all  network  parameters.  Let  also  g( r;  z)  denote  the  (pre-detection)  output  of  the  NN  as  a  function  of  the 
network  input  r  and  parameterized  by  z.  Let  zopt  be  the  value  of  the  vector  z  that  minimizes  a  given  average 
(expected)  cost,  i.e.  zOJrt=argmin  E{C[g(r;  z)]}  (3-7) 

Z 

where  C  denotes  the  cost  as  a  function  of  the  NN  output  and  E  denotes  the  expectation  operation.  Examples 
of  average  cost  functions  include  the  probability  of  error  evaluated  at  the  output  of  the  sign  detector  that 
follows  the  NN  structure  or  the  mean  square  error  (MSE)  evaluated  at  the  output  of  the  NN  itself. 

The  optimization  problem  of  (3.7)  is  defined  over  a  hypersurface  parameterized  with  respect  to  2 H (L+2)+L-|-3 
parameters  in  all.  According  to  Proposition  3.1,  the  optimum  decision  boundary  is  symmetric  with  respect 
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to  the  origin  0  and  passes  through  it.  If  we  consider  these  characteristics  in  forming  the  NN  optimization 
problem,  then  we  can  reduce  significantly  the  size  of  the  parameter  vector. 

First,  we  require  the  hyperplanes  associated  with  the  hidden  neurons  to  be  pairwise  parallel.  Con¬ 
sequently,  we  require  that  the  neurons  of  each  pair  exhibit  antipodal  shifts  from  the  origin  0  (i.e.  same 
absolute  value  and  opposite  sign).  Let  us  denote  the  elements  of  such  an  arbitrary  pair  as  “ neuron  i”  and 
11  neuron  —i”  (i  ^  0).  Then,  the  network  decision  boundary  will  maintain  the  properties  of  the  optimum 
decision  boundary  identified  in  Proposition  3.1  if  the  neurons  of  each  pair  satisfy  the  following  constraints: 
(a)  w_j  =  w i  (the  hyperplanes  are  parallel);  ( b )  b-i  =  —bi  (the  hyperplanes  exhibit  opposite  shifts);  (c) 
b o  =  0  (the  middle  neuron  hyperplane  passes  through  the  origin);  (d)  v_*  =  v*,  i  ^  0  (the  output  weights  of 
the  same-pair  neurons  are  equal  to  maintain  overall  decision-boundary  symmetry) ;  (e)  bout  =  0  (the  neural- 
network  decision  boundary  passes  through  the  origin  0).  The  above  constraints  for  the  L — (2H+1)  —  1 
neural  network  single-user  DS-CDMA  receiver  are  summarized  in  Table  I. 

Taking  into  account  the  above  constraints  we  define  the  reduced  size  parameter  vector  z'  =  [wj  wf, 

. . .  ,w]j,  fq,...,  bH ,  Vo,  Vi,  . . . ,  vH]T .  Then,  we  propose  the  following  constrained  NN  optimization: 

z'opt  =  argmin  E{C[sr(r;  z')]}  (3.8) 

subject  to  the  constraints  of  Table  I. 

We  note  that  each  hidden  neuron  i  6  {-H,  -H  +  1,  . . . ,  H}  of  the  network  (or,  equivalently,  each  hidden 
neuron  weight-bias  pair  (w,,  bi))  determines  a  hyperplane  Vi  in  the  observation  space  that  is  orthogonal 
to  Wj.  The  shift  of  Vi  from  the  origin  0  equals  the  bias  bi.  The  2H+1  hyperplanes  V-H,  'P-H+ll  ■  ■  ■  >  Ph 
are  combined  through  the  output  weights  v-H ,  . . . ,  vH  and  bias  bout.  to  form  the  nonlinear  network 

decision  boundary.  The  optimization  problem  in  (3.8)  appears  in  its  most  general  form;  it  applies  to  any 
given  neural  network  g( r;z')  with  input  r  and  reduced  set  of  parameters  z '.  We  emphasize  that  the  con¬ 
straints  in  Table  I  can  be  used  in  conjunction  with  any  NN  training  method/criterion  that  may  be  used 
to  approximate  the  optimum  decision  boundary.  Incorporation  of  such  constraints  leads  to  significant  per¬ 
formance  enhancement  when  compared  to  unconstrained  training,  which  is  primarily  due  to  the  effective 
reduction  of  the  number  of  system  parameters  to  be  optimized.  For  an  L — (2H+1)  —  1  neural  network, 
this  number  is  reduced  from  2H(L+2)+L+3  parameters  when  unconstrained  optimization  is  pursued  to 
H(L+2)+L+l  when  constrained  optimization  of  the  type  of  (3.8)  is  invoked.  For  large  values  of  H  this 
reduction  amounts  to  nearly  a  50%  decrease  of  the  original  parameter  vector  size.  Additionally,  a  welcome 
side  effect  of  constrained  NN  optimization  is  reduced  risk  for  “redundancy”,  since,  for  example,  neuron  i  can 
no  longer  become  equivalent  to  neuron  —i  due  to  the  constraint  b-i  =  —bi. 

3.4  Constrained  Minimum  BER  NN  Training 

We  now  focus  on  the  training  procedure  for  solving  the  optimization  problem  in  (3.8)  when  the  average 
cost  function  is  the  probability  of  error  at  the  output  of  the  sign  detector  that  follows  the  neural  network 
structure.  We  recall  that  the  NN  receiver  structure  is  represented  by  the  function  g( r;  z'),  operates  on  input 
r,  and  is  parameterized  by  vector  z'.  The  final  bit  decision  do  is  the  sign  of  the  NN  output  g( r;  z'): 

do  =  sgn^rjz')].  (3.9) 

The  probability  of  error  at  the  final  output  of  the  receiver  is 

Pe  =  P[do  ±  d0]  =  7T0P[do  =  1 1  do  =  -1]  +  mP[do  =  -1 1  do  =  1]  (3.10) 

where  7To  =  P[do  =  —1]  and  =  P[do  =  1]  are  the  a  priori  probabilities  of  transmitting  —1  or  +1, 
respectively.  For  the  purposes  of  this  presentation,  we  assume  that  ttq  =  7rf  =  1/2.  To  emphasize  the 
dependency  that  the  induced  probability  of  error  has  on  the  parameter  vector  z',  we  will  denote  the  error 
probability  expression  as  Pe(z').  Our  goal  is  to  adapt  the  NN  weight  and  bias  parameter  vector  z'  such  that 
the  induced  probability  of  error  becomes  minimum.  We  will  refer  to  the  resulting  receiver  as  the  constrained 
minimum  bit-error-rate  NN  (MBER-NN)  detector. 

Let 

u(r,z')  =  sgn[5(r;z')]  (3.11) 
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be  the  output  of  the  receiver.  We  define  the  single-letter  distortion  measure  ((■)  as  follows 

C(r0,ri;z')  =  ^ {tt0 [1  +  w(r0,z')]  +  7n[l  -  u(ri,z')]}  (3.12) 

where  r0  and  rx  are  assumed  to  be  received  data  vectors  from  hypothesis  H0  (i.e.  with  do  =  —  1)  and 
hypothesis  Hi  (i.e.  with  do  =  +1),  respectively.  This  is  the  modification  for  the  antipodal  ±1  case  of  the 
distortion  measure  that  was  first  defined  in  [19].  Intuitively,  £(•)  measures  the  distortion  at  the  output  of 
the  receiver.  The  terms  [1  +  u(r0,z')]  and  [1  —  u(r1,z')]  are  both  zero  when  the  receiver  makes  the  correct 
decision  and  strictly  positive  otherwise.  The  two  terms  are  weighted  by  the  a  priori  probabilities  of  the 
two  hypotheses  Ho  and  Hi,  so  failure  of  the  receiver  to  detect  the  most  likely  hypothesis  will  result  in  a 
proportionally  large  penalty.  As  a  result,  the  expected  value  of  this  single-letter  distortion  measure  (  is  as 
follows  [19]: 


£[C(ro,ri;z')]  = 


E 


-[71-0(1  +  u(r0,z'))  +  tti(1  -  u(ri,z'))] 


7To-P[w(ro,  zl)  =  1]  +  7TiP[u(ri,  zl)  =  -1]  =  Pe{  zl). 


(3.13) 


Equation  (3.13)  allows  us  to  reduce  the  problem  of  finding  the  parameter  vector  zl  that  minimizes  the 
probability  of  error  Pe(z')  to  that  of  finding  the  parameter  vector  zl  that  minimizes  the  expected  value  of 

C(- ,;;*')■ 

Let  in  addition 


Xn(z')  =  [£„ii(z,),...,:r„iM'(z,)]T  (3.14) 

be  a  vector  of  the  same  length  as  z',  with  its  jth  element  xnj( z'),  j  =  1,  2,  ... ,  H(L+2)+L+l,  defined  by 

%n:j  (z  )  —  —  [C(®*0,n5^*l,n5Z  T  Cn&j )  C(^0,n;lTin,Z  CnS  j )]  (3.15) 


where  cn  =  cn~x!A  for  some  c  >  0  and  denotes  the  jth  coordinate  unit  vector.  The  sequences  {r0i„}  and 
{ri„},  n  =  1,  ...,1V,  are  assumed  to  be  pilot  (training)  sequences  of  received  vectors  with  do  =  —1  and 
do  =  +1,  respectively.  We  note  that  in  this  setup,  {ro,n}  and  {ri  n},  n  =  1, . . . ,  N,  consist  of  independent 
identically  distributed  vectors.  The  variance  of  all  components  of  xn(z')  is  finite  for  all  zl .  The  regression 
function  Pe{ zl)  is  twice  continuously  differentiable  with  bounded  second  derivative  at  least  as  long  as  n  in 
(3.1)  is  white  Gaussian  [19].  Finally,  the  above  selection  of  the  gain  sequence  {cn}  guarantees  that  cn  — »  0. 
Then,  for  a  monotonically  decreasing  sequence  of  positive  numbers  {an}  such  that  Y  an  =  oo,  Y  anCn  <  oo, 
and  Yancn  2  <  oo ,  the  recursion 


Zn+1  =zn  ~an^n(z'n)  U=1,...,N  (3.16) 

converges  with  probability  one  (w.p.  1)  to  the  parameter  vector  z',  say  9 ,  that  minimizes  Pe{ z')  provided 
that  for  every  e  >  0  there  exists  a  positive  number  J(e)  such  that 


||z'  —  0||  >  e  implies 

Pe( zl)  >  Pe(9)  and  ||VPe(z,)||  >  J(e). 


(3.17) 


The  foundation  of  the  recursive  algorithm  in  (3.16)  is  provided  by  [20]  which  is  an  extension  to  multivariate 
regressions  of  the  well-known  Kiefer- Wolfowitz  stochastic  approximation  method  [21]  for  finding  the  extrema 
of  a  regression  function.  At  every  stage  of  recursion  (3.16)  each  component  of  the  gradient  of  the  regression 
function  Pe{ z')  is  evaluated/estimated  simultaneously  by  the  two-sided  difference  approximation  in  (3.15). 
Common  selections  for  the  sequence  cn  in  (3.15)  and  the  learning  gain  an  in  (3.16)  are  cn  =  cn _1/4,  c  >  0, 
and  an  =  a/n,  a  >  0,  respectively.  These  selections,  however,  are  arbitrary  and  any  sequences  that  satisfy 
cn  — »  0,  an  — »  0,  Y  an  =  oo,  Y  anCn  <  oo,  Y  ancn2  <  °°  can  be  used. 
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3.5  Importance-Sampling-accelerated  MBER-NN  optimization 


The  constrained  MBER  training  algorithm  described  in  the  previous  section  (eq.  (3.16))  exhibits  slow 
convergence  as  the  NN  approaches  a  low  BER  operating  region.  This  is  caused  by  the  fact  that  z 'n  is 
updated  only  when  one  of  the  distortion  measures  C(ro,n;  ri,ni  z'  +cnej)  or  C(ro,n>  ri z '  —  cnej)  is  non- zero, 
i.e.  when  an  “error”  occurs.  This  event  may  become  extremely  rare  when  the  BER  of  the  NN  is  low  and 
the  value  of  cn  is  small.  To  overcome  this  limitation  we  modify  the  MBER  recursion  in  (3.16)  to  achieve  a 
higher  paranreter-update  rate  (and,  thus,  a  higher  convergence  rate)  by  creating  virtual  high  BER  receiver 
conditions.  This  is  accomplished  by  altering  the  received  vector  statistics  using  importance  sampling  (IS) 
principles.  We  note  that  importance  sampling  is  traditionally  used  for  system  performance  evaluation;  in 
contrast,  what  we  propose  here  for  the  first  time  is  to  use  IS  directly  for  receiver  design  optimization. 

We  recall  that  the  MBER  training  algorithm  described  in  the  previous  section  operates  on  data  vectors 
ro  and  ri  whose  conditional  pdfs  are 


and 


/o(r)  =  /(r|d0  =  -1)  = 
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1  1 


2*"1  Vi™  ^ 


^  1 

2^exP 


2<72 


K- 1 


r  +  V E0s0  +  \fEkd\. 


1  1 


2*-1  jri 


1  /  i 


k  =  1 


K- 1 


r  —  y/ E0s0  +  \fEkd\. 


fc=:l 


(3.18) 


(3.19) 


respectively. 

Following  the  principles  of  IS,  the  proposed  IS-based  constrained  MBER-NN  training  algorithm  operates 
on  vectors  yo  and  yi  drawn  from  distributions  ho  and  hi,  respectively,  that  are  different  than  fo  and  fi  in 
(3.18)  and  (3.19).  In  general,  the  densities  ho(-)  and  hi(-)  should  be  chosen  so  that  the  BER  of  the  receiver 
(operating  on  received  vectors  drawn  from  ho  and  hi)  is  high  enough  to  induce  plenty  of  “errors”  during  the 
training  process.  Thus,  the  choice  of  ho  and  hi  is  application  dependent.  For  the  problem  of  DS-CDMA 
receiver  design,  we  use  parametrically  described  densities  hotp(y)  and  hip(y)  parameterized  by  a  scalar 
P  G  (0, 1]  that  are  given  by 

2K~l 

''«<y)  =  g  “p 

and 

2K -1 

h,Ay)  =  K  “p 

In  other  words,  hotp(y)  and  hitp(y)  are  densities  of  virtual  received  vectors  where  the  desired  signal  is  atten¬ 
uated  by  a  factor  /3  relative  to  the  received  vectors  distributed  according  to  (3.18)  and  (3.19),  respectively. 
For  j3  =  0  we  have  complete  attenuation  of  the  desired  signal  while  for  /3  =  1  we  have  no  attenuation  at  all. 
The  advantage  of  the  specific  choices  for  ho,p  and  hitp  in  (3.20)  and  (3.21)  is  that  we  can  easily  draw  vectors 
from  this  distribution  by  subtracting  a  part  of  the  desired  signal  from  the  actual  received  vector  as  follows: 


Yo  —  ro  +  (1  —  P)\/Eoso  (3.22) 

yr  =  rx  -(1-/3)/EqSo.  (3.23) 

Using  fo,  hotp  and  fi,  hitp  we  define  the  following  weighting  functions  Wo(y)  and  Wi(y): 

Wo{y)  =  fo{y)/ho,p(y)  (3.24) 

Wi(  y)  =  fi(y)/hiip(y).  (3.25) 

Let  also  C,(yoiyi;z/)  be  a  new  modified  distortion  measure  with  equal  a  priori  probabilities  defined  as 

C'(yo,yi;z')  =  ^{Wo(yo)[l  +  sgn[5(yo;z,)]  +  Wi(yi)[l  -  sgn[g(yi;  z')]}  .  (3.26) 


1 

'2^2 


K- 1 

y  —  PV Eqsq  +  V Ekd\, 

k= 1 


k 


(3.21) 


1 

'2^2 


K- 1 


y  +  Py/EoSo  +  ’s/Ek^k 


k*k 
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(3.20) 
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Then,  it  is  straightforward  to  show  that 

£{C'(yo,  yi;  z')}  =  £{C(r0,  it;  z')}  =  Pe(  *')■  (3-27) 

In  other  words,  the  expected  value  of  the  distortion  measure  ('  is  equal  to  the  true  BER  of  the  NN  pa¬ 
rameterized  by  z'  when  the  latter  operates  on  received  vectors  r  distributed  according  to  /(•)  given  by 
(3.2). 

Proceeding  similarly  to  Section  IV,  let 

<(z 0  =  [<1(z%..,<M'(zT  (3-28) 

be  a  vector  with  its  jth  element  x'n  •( z'),  j  =  1,  2,  ,  H(L+2)+L+l,  defined  by 

x'n,j(z')  =  J—\C'(yO,n,yi,n,z'  +  Cnej)  -  C'  (yo,n,  yi.n !  z'  ~  CnGj)}.  (3.29) 

zcn 

In  (3.29),  {yo,n}  and  {yi,n},  n  =  1, ...  ,7V,  are  sequences  obtained  from  the  pilot  (training)  sequences  {ro,n }, 
{ri>n}  by  (3.22)  and  (3.23),  respectively.  Then,  from  (3.27)  we  see  that  for  sequences  of  positive  numbers 
an  and  cn  such  that  cn  — >  0,  an  — >  0,  ^  an  =  oo,  (7)  ancn  <  oo,  )7  o^c“2  <  oo  the  recursion 

zn+l  =Z'n~  dnX^zJJ,  Tl  =  1,  .  .  .  ,  TV  (3.30) 

converges  with  probability  one  (w.p.  1)  to  the  value  of  z'  that  minimizes  Pe(z').  In  comparison  with  (3.16), 
the  convergence  rate  of  the  recursion  in  (3.30)  is  much  improved  since  in  this  latter  case  the  network  during 
training  operates  always  in  a  virtual  high  BER  region  due  to  the  attenuated  desired  signal  in  the  training 
vectors  y0,„  and  yi,„. 

We  must  note  at  this  point  that  theoretically  the  weighting  functions  Wo(y)  and  W\(y)  in  (3.24),  (3.25) 
require  exact  knowledge  of  the  pdfs  hij g(-)  and  /,;(■),  i  =  0,1.  For  a  practical  solution,  we  propose  to 
approximate  the  above  pdfs  by  colored  Gaussian  densities  as  follows.  Let 

K- 1 

Ti+n  =  ^2  V^kdkSk  +  n  (3.31) 

k= 1 

be  the  interference-plus-noise  component  of  r.  The  vector  vI+N  is  distributed  according  to  a  multivariate 
Gaussian  mixture  of  2A_1  components.  We  approximate  this  mixture  by  a  multivariate  colored-Gaussian 
distribution  with  mean  Olxi  and  covariance  matrix  R/+jv  =  E{rI+NrJ+N}  =  Y^k= i  EkSkA  +  a2 1.  The 
approximated  pdfs,  that  we  distinguish  from  the  exact  hiA')  and  /*(•),  i  =  0, 1  by  using  the  symbol  ~,  are 
given  below: 

ho,p{y)  =  )  - yy^exp  (-i(y  +  /3v/^s0)TR7H!JV(y +  /3\/^so))  (3.32) 

2(27T)^'z  |Rj+jv|  V  / 

hiAy)  =  77777- - y7yexp(-i(y-/3V^so)TR7H!JV(y-/3/^s0))  (3.33) 

2(27r)Iy'2  |R/+jv|  k  / 

/o(r)  =  — — l— - yyyexp  (-i(r+ v/^s0)TR7+1JV(r+ /^S0))  (3.34) 

2(2tt)i>'2  |R/+jv|  k  7 

AM  =  77777- - y7yexp(-^(r-/^s0)TR7+1JV(r-/^s0)).  (3.35) 

z\z^)  '  |R/+jv|  k  *  / 

We  observe  that  hi  A  y),  *  =  0, 1  is  an  explicit  function  of  the  second  order  statistics  of  rI+N  which  are  easy 
to  estimate.  The  estimation  proceeds  as  follows.  First,  we  observe  that  E{do(n)rn}  =  y/Eoso-  Thus,  we 
may  estimate  the  product  y/Eoso  by 

-  1  T 

\/EqSq  =  —  ^  d0(n)rn  (3.36) 

n= 1 
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where  T  is  the  number  of  available  samples.  We  can  estimate  the  interference-plus-noise  component  of  the 
vectors  r„  by 

ri+N,n  =  r„  -  d0(n)\/E0s0.  (3.37) 

Then,  the  interference- plus- noise  covariance  matrix  estimate  can  be  evaluated  as  R I+N  = 

=  ^  ]T)n==i  r 'J+  N..nrJ+N  „.  Substituting  the  above  estimates  in  (3.33)  we  obtain  the  estimates 


ho,p(y) 

hi  Ay) 

fo  (r) 
A  (r) 


1 


1/2 

2(2ir)L/2 

R I+N 

1 

1/2 

2(2? t)l/2 

R/+JV 

l 

1/2 

2(2tt)l/2 

R/+jv 

1 

1/2 

2(2tt)l/2 

R/+jv 

exp 


exp 


exp 


exp 


2  (y  +  /3\/^so)TR/+1JV(y  +  /3\/^oso)^ 

^(y  -  /3v/^oso)TR-7+1jv(y  -  pAeqso)^ 

~2^r  V/^oso)T-^'-r+iv(r  +  V^o)) 
~2^r  ~  V/^oso)T^/+jv(r  _  V^qSq)^  • 


(3.38) 

(3.39) 


(3.40) 

(3.41) 


Finally,  an  estimate  of  the  two  weighting  functions  is  given  by 


W0(yn)  =  f  0(9n)/ho,p(yn) 

Wx( y„)  =  /  i(y„)/fti,/3(y„) 

where  _ 

9n  =  r„  -  d0(n)(l  -  P)y/Eos0. 


(3.42) 

(3.43) 

(3.44) 


3.6  Simulation  Studies 

We  consider  a  synchronous  DS-CDMA  system  populated  by  K  =  6  users,  each  with  a  length  L  =  16 
signature.  The  signature  cross-correlations  between  users  are  in  the  range  [0.125,0.375].  In  each  and  every 
study  below,  the  BER  results  that  we  present  are  averaged  over  100  independent  neural-network  receiver 
training  realizations  (each  realization  corresponds  to  a  single  epoch  consisting  of  4000  data). 

We  examine  two  feedforward  MLP  neural  network  structures  for  H  =  1  and  H  =  3,  i.e.  one  of  size  16 — 
3 — 1  (Fig.  3.2)  and  the  other  of  size  16 — 7 — 1  (Figs.  3. 3-3. 5).  The  MLP  NN  of  the  desired  user  is  trained  by 
either  the  plain  MBER  algorithm  (defined  similarly  to  (3.16)  with  z  in  place  of  z'),  or  the  constrained-MBER 
algorithm  in  (3.16),  or  the  IS-constrained-MBER  algorithm  in  (3.30).  For  each  case,  the  BER  performance 
is  shown  as  a  function  of  the  number  of  new  training  data  arriving  at  the  receiver.  As  a  comparative  study 
we  also  consider  the  standard  backpropagation  (BP)  training  algorithm2.  We  recall  that  the  BP  algorithm 
is  a  recursive  procedure  that  aims  at  the  minimization  of  E{[do  —  g(r;z)]2},  i.e.  minimizes  the  mean-square- 
error  (MSE)  between  the  output  of  the  MLP  g( r;z)  and  the  desired  signal  do-  According  to  stochastic 
approximation  principles,  the  BP  algorithm  originates  from  the  recursion 

z„+i  =  zn  -  [d0(n)  -  g( r„;  z„)]2  (3.45) 

for  any  monotonically  decreasing  sequence  of  positive  numbers  {/.in}n  such  that  =  00  an<^  < 

oo  ({r„}„  is  a  pilot  sequence  of  input  data  vectors). 

2The  results  in  this  work  apply  to  sy stems/ neural  network  structures  with  any  value  of  K,  L  and  H.  We  note, 
however,  that  higher  values  of  L  and  H  may  require  prohibitive  amount  of  time  for  evaluation  of  BER  performance 
of  any  neural  network  algorithm  through  Monte-Carlo  simulations. 
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Table  3.1.  Neural- Network  Symmetry  Constraints 


All  algorithms  are  initialized  with  the  same  parameter  values  drawn  randomly  and  uniformly  in  the 
interval  [—0.25,0.25].  For  Figs.  3.2  and  3.3,  the  energy  of  the  user  of  interest  is  set  at  14dB  while  the 
energies  of  the  interferers  range  from  8-lldB.  We  see  that  the  IS-constrained-MBER  algorithm  converges 
noticeably  faster  in  both  studies  of  Fig.  3.2  and  Fig.  3.3.  In  both  figures,  all  various  forms  of  the  MBER 
training  algorithm  outperform  the  standard  BP  training  algorithm.  In  Fig.  3.4,  we  repeat  the  same  study 
as  in  Fig.  3.3  where  now  the  user  signal  of  interest  is  at  lOdB  and  all  five  interferers  are  at  12dB.  In  this 
figure  we  also  include  the  performance  curve  of  the  MLP-NN  single  user  detection  algorithm  proposed  in 
[2].  We  note  that  the  algorithm  in  [2]  requires  noiseless  training  data  as  well  as  knowledge  of  the  interfering 
information  bits.  The  performance  of  the  network  when  trained  by  the  algorithm  in  [2]  is  sensitive  to  the 
number  of  hidden  neurons  of  the  structure  (it  has  been  reported  that  a  large  number  of  hidden  neurons  - 
exponential  in  the  number  of  users  -  is  necessary  to  achieve  acceptable  performance  levels  [4]).  Finally,  in 
Fig.  3.5  we  compare  the  performance  of  the  training  algorithms  of  Fig.  3.4  when  used  on  a  larger  MLP-NN 
(16 — 7 — 1)  with  smaller  user  population  ( K  =  3).  Again,  we  see  that  the  IS-constrained-MBER  algorithm 
converges  noticeably  faster. 

3.7  Conclusions 

Motivated  by  the  ultimate  performance  metric  of  a  digital  communications  link,  namely  BER,  a  fast  con¬ 
verging  adaptive  minimum  BER  MLP-NN-based  receiver  architecture  for  DS-CDMA  communications  was 
developed.  The  adaptive  receiver  incorporates  the  BER  optimization  criterion  directly  into  the  NN  learning 
process.  Furthermore,  properties  of  the  optimum  single-user  decision  boundary  for  AWGN  multiple-access 
channels  were  translated  into  a  set  of  constraints  that  were  incorporated  in  the  minimum  BER  training  pro¬ 
cedure,  thus,  significantly  reducing  the  number  of  network  parameters  (weights  and  biases)  to  be  optimized. 
Coupled  with  importance  sampling  (IS)  principles  embedded  directly  into  the  receiver  optimization  process, 
the  end  result  is  faster  network  convergence  and  lower  vulnerability  to  architectural  NN  redundancy.  The 
proposed  unconventional  use  of  IS  does  not  assume  knowledge  of  the  input  statistics  and  requires  only  a  short 
pilot  sequence  to  create  virtual  high  BER  receiver  conditions  that  accelerate  the  NN  parameter  optimization. 

Simulation  results  supported  these  theoretical  developments  and  illustrated  the  merits  of  the  minimum 
BER  training  approach  as  well  as  the  significant  convergence  rate  gains  obtained  as  a  result  of  the  proposed 
IS-constrained  method. 


63 


Fig.  3.1.  Neural  network  receiver  archite 


64 


Fig.  3.2.  BER  performance  versus  number  of  training  data  for  a  16-3-1  MLP  NN  (L=16,  H= 1) 
trained  by  either  the  standard  BP  algorithm  in  (3.45),  or  the  plain  MBER  algorithm  (defined 
similarly  to  (3.16)  with  z  in  place  of  z'),  or  the  constrained-MBER  algorithm  in  (3.16),  or  the 
IS-constrained-MBER  algorithm  in  (3.30). 


Fig.  3.3.  BER  performance  versus  number  of  training  data  for  a  16-7-1  MLP  NN  (L=16,  H= 3) 
trained  by  either  the  standard  BP  algorithm  in  (3.45),  or  the  plain  MBER  algorithm  (defined 
similarly  to  (3.16)  with  z  in  place  of  z'),  or  the  constrained-MBER  algorithm  in  (3.16),  or  the 
IS-constrained-MBER  algorithm  in  (3.30). 
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Fig.  3.4.  Same  study  as  in  Fig.  3.3  with  the  user  signal  of  interest  at  lOdB  and  the  five  interferers  at 
12dB.  The  16-7-1  MLP-NN  (L=16,  H= 3)  is  trained  by  either  the  standard  BP  algorithm  in  (3.45), 
or  the  noiseless  training  scheme  in  [2],  or  the  plain  MBER  algorithm  (defined  similarly  to  (3.16) 
with  z  in  place  of  z;),  or  the  constrained-MBER  algorithm  in  (3.16),  or  the  IS-constrained-MBER 
algorithm  in  (3.30). 
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Fig.  3.5.  BER  performance  versus  number  of  training  data  for  a  16-7-1  MLP  NN  (L=16,  H= 3) 
with  K  =  3  (with  the  user  signal  of  interest  at  lOdB  and  the  two  interferers  at  12dB).  The  neural 
network  is  trained  by  either  the  standard  BP  algorithm  in  (3.45),  or  the  noiseless  training  scheme 
in  [2],  or  the  plain  MBER  algorithm  (defined  similarly  to  (3.16)  with  z  in  place  of  z'),  or  the 
constrained-MBER  algorithm  in  (3.16),  or  the  IS-constrained-MBER  algorithm  in  (3.30). 
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Chapter  4 


On  the  Relative  Output  SINR  of  Full 
and  Partial  Decorrelators 


4.1  Introduction 

Linear  DS/CDMA  multiuser  detectors  have  emerged  as  low  complexity  alternatives  to  the  optimum  multiuser 
detector  [1],  Within  the  class  of  linear  detectors,  minimum- mean-square-error  (MMSE)  and  decorrelating- 
type  receivers  strike  an  appealing  balance  between  complexity  and  performance.1  In  this  work,  we  focus 
solely  on  linear  decorrelating-type  receivers  [3]- [8]. 

In  the  presence  of  additive  white  Gaussian  noise  (AWGN),  full  decorrelation  of  the  multiple  access 
interference  (MAI)  results  in  enhanced  noise  variance  at  the  decorrelator  output.  Partial  decorrelation 
occasionally  strikes  a  better  balance  between  interference  suppression  and  noise  enhancement  as  it  was 
observed  in  [9].  A  partial  decorrelator  (PDEC)  chooses  purposefully  to  decorrelate  only  a  part  of  the  MAI 
as  opposed  to  the  full  decorrelator  (FDEC)  that  decorrelates  completely  the  MAI.2 

In  this  work,  motivated  primarily  by  scientific  curiosity  rather  than  searching  for  practical  solutions,  we 
identify  necessary  and  sufficient  conditions  on  the  system  parameters  under  which  a  PDEC  outperforms  the 
FDEC  in  the  output  SINR  sense.  Our  developments  provide  further  insight  into  the  operational  charac¬ 
teristics  and  limitations  of  decorrelating  receivers  and  show  why  decorrelation  of  MAI  that  meets  certain 
conditions  should  be  purposefully  avoided,  if  at  all  possible.  As  a  side  study,  we  consider  a  blind  implemen¬ 
tation  of  the  FDEC  that  is  based  on  eigendecomposition  of  the  interference-plus-noise  autocovariance  matrix 
and  can  be  readily  modified  to  provide  a  PDEC  as  explained  in  the  sequel. 


4.2  Signal  Model 

We  consider  a  A'-user  synchronous3  DS/CDMA  communication  system  that  utilizes  short  spreading  codes 
of  length  L.  In  this  system,  user-A,  A:  =  0, . . . ,  K  —  1,  is  assigned  a  normalized  signature  vector  s where 
s k[l]  =  {±l/\/A},  l  =  1,  2,  •  •  •  ,  L.  The  signature  matrix  is  denoted  by  SLxK  =  [so  s,  •  •  •  s^]  and  is 
assumed  to  be  full  rank  (rank  AT).  In  the  rest  of  the  work,  we  focus  on  single-user  detection  and  without 
loss  of  generality  we  assume  that  the  user  of  interest  is  user  0.  After  chip-matched  filtering  and  chip-rate 

1In  extreme,  non-practical  interference  models  the  matched-filter  (MF)  scheme  may  outperform  the  decorrelating 
or  MMSE  detector  in  terms  of  bit-error-rate  (BER)  [2]. 

2This  is  in  contrast  to  unintentional  operation  under  partial  decorrelating  conditions  when  unknown  interference 
is  present  in  the  system  (e.g.  inter-cell  interference  [10]). 

3  Generalization  to  asynchronous  transmissions  over  multipath  channels  is  straightforward  if  we  formulate  the 
asynchronous  channel  as  a  synchronous  one  with  a  higher  virtual  active  user  population  and  perform  “one-shot” 
decorrelation  [1]. 
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sampling,  the  baseband  received  signal  vector  over  one  bit  period  takes  the  form 


K-l 

r  =  \f%bk sfc  +  n  =  t/e^oSq  +  i  +  n,  (4.1) 

k= o 

where  Ef.  and  bk  denote  the  bit  energy  and  the  information  bit  of  user-fc,  respectively,  (the  binary  information 
bits  are  assumed  to  take  the  values  ±1  independently  and  equiprobably),  i  =  Yk^X  V^kbksk  identifies 
comprehensively  the  MAI,  and  n  represents  the  AWGN  vector  with  mean  0  and  autocovariance  matrix 
cr2IL,  where  IL  is  the  L  x  L  identity  matrix. 


4.3  Full  Decorrelator  (FDEC) 

Under  the  assumption  of  perfectly  known  signature  matrix  S,  the  FDEC  for  user  0  is4 

ds  =  SX-4ei  (4.2) 

where  X  =  STS  is  the  signature  cross-correlation  matrix  and  ei  =  [10  •  •  •  0]T.  We  note  that  with  simple 
algebraic  manipulations  we  can  show  that  ds  in  (4.2)  is  a  scalar  multiple  of  the  filter  (1^  —  S\oS+0)s0  where 

S\o  =  [si, . .  ,■ , and  the  superscript  +  denotes  the  Moore-Penrose  generalized  inverse  [1]. 

The  FDEC  can  be  also  formed  by  means  of  ED  of  the  desired-signal-present  received-vector  autoco¬ 
variance  matrix  R  =  E{ rrT}  =  Yk=o  Ek.Sk.sJ,'  +  er2IL,  as  follows.  Let  the  ED  of  R  be  R  =  QAQT, 
where  A  =  diag(Xo,  Ai,  ■  ■  •  ,  A^.i),  Ao  >  •••  >  XK-i  >  XK  =  ■  ■  ■  =  XL_ x  =  u2,  are  the  eigenvalues  of  R 
in  descending  order,  and  Q  =  [qo, . . . ,  qt_rj  denotes  the  matrix  of  the  corresponding  eigenvectors.  Let 
Qs  =  [qo  qi  •  •  •  QK-l]  5  where  q0,qi,---  ,qA--i  span  the  sT/QuciI  subspuce^  and  A.^  — 

Then,  the  FDEC  for  user  0  takes  the  form: 


d 


EDI 


Qs(As-<t2Ia)  'Cfts* 


K-l 


E 


wql 

Xk  -  cr2 


s0- 


(4.3) 


The  FDEC  dED1  in  (4.3)  was  proposed  in  [7]. 

In  the  following,  we  derive  an  alternative  subspace-type  FDEC,  dED2,  that  instead  of  R,  utilizes  the 
desired-signal- absent  (interference-plus- noise  only)  received- vector  autocovariance  matrix,  RJ+JV  =  ^{(i  +  n) 
(i  +  n)T}  =  R  -  E0soSt.  Let  the  ED  of  R/+JV  be  R/+JV  =  UDUT  =  [U,  U^]D[U,  U^]T,  where  D  = 
diag(di,  •  •  •  ,  dL)  denotes  the  diagonal  eigenvalue  matrix,  and  =  [ux  u2  •  •  •  u,^]  (i.e.  the  columns  of 

U/  span  the  MAI-subspace) .  The  filter  dED2  is  given  by 

dED2  =  (I  -  U,Uf)  s0,  (4.4) 


i.e.  dED2  is  the  component  of  So  that  is  orthogonal  to  the  MAI  subspace.  We  note  that  the  decorrelating 
filter  dED2  is  a  CDMA  version  of  the  “eigencanceler”  [11]-[13] .  Under  perfect  knowledge  of  S,  R,  and 
R/+JV,  FDECs  ds,  dED1,  and  dED2  are  all  equivalent5  (scalar  multiples  of  each  other)  which  implies  that  the 
corresponding  output  SINRs  7S,  7BDi,  and  'yED2  are  equal  and  given  by 


(dJso)2£o  Eo 
\\ds\\2a2  c2  {X-4 


(4.5) 


where  {X  1  identifies  the  (i,  j)th  element  of  matrix  X  1. 

4The  bold  subscript  “s”  of  vector  d  is  used  to  distinguish  the  signature-matrix-based  (S-type)  decorrelator  ds 
from  the  eigendecomposition  (ED)-based  structures  (identified  by  the  subscript  “ed”)  that  will  be  presented  later  in 
this  section.  To  avoid  any  confusion,  we  clarify  that  the  non-bold  subscript  “s”  of  a  matrix  will  indicate  operation 
in  the  complete  signal  subspace  while  “s-1-”  will  indicate  operation  in  the  corresponding  orthogonal  subspace. 

5Any  filter  w  that  satisfies  the  constraints  (i)  wTs0  >  0,  (ii)  wTSfc  =0,  k  =  1,  •  •  •  ,  K  —  1,  and  (Hi)  w  lies  in  the 
range  of  S,  yields  the  same  bit  decisions  as  ds. 
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However,  when  R  and  RJ+JV  are  unknown  and  are  sample- average  estimated  the  corresponding  data- 
adaptive  FDECs  differ  and  so  do  their  SINR  and  BER  performance.  The  sample- aver  age  estimate  of  R  is 
R  =  jj  J2m= i  rmrm  where  M  is  the  size  of  the  available  data  record.  R/+iv  can  be  estimated  similarly 

during  a  silent  period  of  the  user  of  interest.6  By  performing  eigenvalue  decomposition  on  R  and  Rj+tv  we 

obtain  the  estimates  of  dED1  and  dED2: 

dEDi  =  QS(AS  -  ff2IJf)_1Qjs0  (4.6) 

dED2  =  so  —  UjUj  So-  (4-7) 

Comparing  expression  (4.6)  and  (4.7)  we  observe  that  the  estimate  dED2  is  a  function  of  the  eigenvector 
estimates  U,  only,  while  dED1  is  a  function  of  both  the  eigenvector  estimates  Qs  and  the  eigenvalue  estimates 
As.  Most  importantly,  evaluation  of  (4.6)  requires  inversion  of  the  modified  eigenvalue  matrix  (As  —  a2IK), 
an  operation  that  yields  increased  estimation  variance  particularly  when  estimation  is  based  on  a  short  data 
record.  Therefore,  it  is  expected  that  dED2  outperforms  dEr>1  when  the  number  of  available  data  is  limited 
(and  the  same  is  true  with  the  corresponding  PDECs  developed  in  the  next  section). 

4.4  Partial  Decorrelator  (PDEC) 

While  the  FDEC  decorrelates  the  complete  MAI,  a  PDEC  aims  at  decorrelating  only  a  part  of  it  by  excluding 
one  or  more  user-signatures  or  eigenvectors  from  the  corresponding  implementation  method.  In  this  section, 
we  investigate  the  conditions  under  which  the  PDEC  outperforms  the  FDEC  in  the  output  SINR  sense. 

Let  dSXj,  dEr,1V7-,  and  d ED2\j  denote  the  PDEC  for  user  0  that  decorrelates  all  interfering  users  but 
user  j,  j  =  1, . . . ,  K  —  1,  eigenvector-direction  j  of  R,  j  =  0, . . . ,  I\  —  1,  or  eigenvector-direction  j  of  R/+JV, 
j  =  1, . . . ,  K  —  1,  respectively7. Similarly,  we  define 

—  [so  •  •  •  •  •  •  sE_1], 

X\i  =  S^S.,,  ^  £  S^Sj, 

Q s\j  =  [Ro  ■  ■  ■  Rj-i  Rj+i  ‘  ‘  ‘  Qa'-i]i 
A S\j  =  diag[Ao  •  •  •  Aj_i  Ay _)_ i  •  •  •  Aa-_x], 

U I\j  =  [ui  •  •  •  Uj_!  um  ■  ■  ■  UK_±]. 

Then,  the  PDECs  that  correspond  to  the  FDECs  presented  in  the  previous  section,  as  well  as  their  output 
SINRs  are  given  below. 


d  s\j 
7s  \j 


d 
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S\,x-/ei, 

(d^so)^o 

1 1  dS\y  1 1 2cr2  +  Ej(d^\jSj)2 


En 


a  2{X-1}1,1+E,(efX-/^.)2’ 

Q S\j  (A s\j  —  Qs\is0 

T 

J  qlql 

QfiDl  y  OS0, 

Xj  -  O-2 


(4.8) 


(4.9) 


(4.10) 


6 Alternatively,  R/+jv  can  be  estimated  in  a  decision-directed  way,  Rj+jv  =  X)m=i  [r™  —  6o(m)\/rBosoj 

|^rm  —  bo(m)  V^BoSoj  •  The  bit  estimates  &o(m)  may  initially  be  the  outputs  of  the  matched  filter  detector,  then 
switched  to  the  outputs  of  the  decorrelator  detector  when  enough  samples  are  accumulated.  A  simple  estimator  of 

the  product  V^BoSo  is  VfSoSo  =  Z)m=i^o(m)rm  [14]-[15]. 

7The  construction  procedures  and  conditions  under  which  PDECs  outperform  the  corresponding  FDECs  described 
in  this  section  can  be  generalized  in  a  straightforward  manner  to  the  case  where  more  than  one  signature  or  eigenvector 
direction  is  eliminated. 
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1edi\j 

^ED2\j 


Kdi\jSo)2e0 

d^RdED^-(d^s0)2Eo’ 
s0  -  UAiUfxjSo  =  dEE2  +  Uj-uJsq, 


r)ED2\j 


(dL2Xjs0)^p 

||dED2Xi||2(72  +  ( dj  -  (72)(sJUi)2 

Ep  [||U^s0||2  +  (s^.)2]2 

a2  1 1 U^l s0||2  +  dj/a2(sQ\ij)2  ' 


(4.11) 

(4.12) 


(4.13) 


The  following  proposition  identifies  conditions  on  the  interference  SNR  and  the  signature  cross-correlations 
under  which  the  PDEC  outperforms  the  FDEC  in  terms  of  output  SINR. 


Proposition  4.1  (i)  7sxj  >  7S  if  and  only  if  ^  <  (l  -  ifj ,  (ii)  7EDiXj  >  7EEi  if  and  only 


if 


lEDl 

1+7ED1 


>  Eo 


2(AJ-g2) 


(Ufc  so)2 
2-1  A  ,,—a2 


(qTso)2 


k= o 


(ufso)2 


(iii)  lED2\j  >  1ED2  if  and  only  if  <  2  +  ||XJ^So||2 


Proof:  (i)  Let  S  =  [Sxj  Sj]  and  X  =  STS.  There  exists  a  permutation  matrix  PKxk  such  that  S  =  SP, 
thus,  X  =  (SP)T(SP)  =  PTXP  and  X"1  =  PTX1P.  In  addition,  {XT1}!,!  =  {X"1}!,!, 
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* 3 
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X, 
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3  \3 


i-^x-v;-1 


bT 

’ 3  ^\3  'f'O 


(4.14) 


and 


[^3-^3% 

Then,  (4-5)  and  (4-9)  lead  to  the  following  expression: 

1  1 


=  xr/  +  x-i/fi  -  if  x-.V,]. 


Eo  . 

,7  s\j  7s. 

=  k2{X-1}i.i+^(efX-1S^.s,)2]  -^{X-1},,! 

=  E,(e^X-V,)2  -  a2  [{X  {X 


=  (efX-/^-)2  S,-a2/(l-^X-/^.) 


(4.15) 


(4.16) 


Therefore,  7SXJ>7S  i/  and  only  i/  <  (l  -  xfJX^xf^ 


(ii)  Instead  of  a  direct  comparison  of  'yED1  and  'yEDi\j,  we  evaluate  (and  compare)  the  function  /( 7)  = 
/or  7  =  7Br,i  and  7  =  7EoiXj  (we  note  that  /( 7)  is  monotonically  increasing  for  7  >  OJ. 
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(4.17) 


filEDl)  >  Eq 
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(iii)  Tfce  on/pn/  STAR  of  dED2  is  7 BD2  =  ( J°  =  f8-||U^s0||2.  ITien, 
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/  T  \2 

where  d  is  a  positive  scalar.  Therefore,  whenever  ^  <  2  +  we  have  7 BD2  <  ')ED2\j- 


4.5  Simulation  Results 

We  consider  a  DS/CDMA  system  with  A'  =  10  active  users  and  processing  gain  L  =  31.  First,  we  examine 
the  synchronous  single-path  case.  The  signature  cross-correlation  between  the  interfering  users  and  user  0 
is  in  the  range  (0.03,  0.29).  The  input  SNR  of  user  0  is  fixed  at  12  dB  while  the  input  SNR  of  the  interferers 
are  fixed  between  8  and  14  dB. 

Fig.  4.1(a)  shows  the  output  SINR  of  the  ideal  PDECs  as  a  function  of  the  index  of  the  signature  or 
eigenvector  that  is  eliminated  from  the  FDEC  structure  when  forming  the  corresponding  PDEC.  The  output 
SINR  of  the  ideal  MF  and  FDEC  are  also  included  as  reference  points.  Fig.  4.1(b)  shows  the  corresponding 
BER  values.  We  see  that  for  this  specific  system  setup,  the  PDEC  that  ignores  the  signature  or  eigenvector 
with  index  j= 9  has  higher  SINR  and  lower  BER  than  the  FDEC. 

Fig.  4.2(a)  shows  the  output  SINR  of  estimated  full  and  partial  (j=9)  decorrelators  as  a  function  of  the 
data  record  size;  Fig.  4.2(b)  presents  the  corresponding  BER  values.  The  output  SINR  and  BER  values  are 
averages  over  400  independent  decorrelator  estimates  (data  record  realizations).  R/+JV  is  estimated  during 
a  silent  period  of  the  user  of  interest.  For  comparison  purposes,  we  also  included  in  our  study  the  blind 
decorrelating  algorithm  of  [8]. 

In  Fig.  4.3,  we  assume  that  each  user  experiences  a  3-path  Rayleigh  fading  channel.  The  propagation 
delays  of  the  users  are  independent  and  uniformly  distributed  over  a  bit  interval.  We  assume  that  the 
receiver  is  synchronized  with  respect  to  the  first  path  of  the  user  of  interest.  To  illustrate  acceptable  BER 
levels,  we  consider  a  3  dB  total  input  SNR  increase  for  all  users  in  comparison  with  the  corresponding  SNR 
values  used  in  Figs.  1  and  2.  We  evaluate  the  output  SINR  and  BER  performance  of  the  estimated  FDECs, 
the  blind  decorrelating  algorithm  of  [8],  as  well  as  the  performance  of  the  best  PDEC  of  type  EDI  and 
ED2.8  The  results  that  we  present  are  averages  over  10000  runs  (1000  independent  asynchronous  multi-path 
fading  channel  realizations  and  10  independent  received  data  record  generations  per  channel).  Figs.  2  and 
3  demonstrate  that  in  environments  where  the  data  support  that  is  available  for  adaptation  and  redesign  is 
limited,  ED2-type  decorrelators  outperform  ED  1-type,  both  under  full  and  partial  decorrelation. 

8 The  best  PDEC  is  the  PDEC  that  exhibits  maximum  output  SINR  and  is  found  by  exhaustive  search  over  all 
PDECs.  The  SINR  of  each  candidate  filter  is  evaluated  using  the  ideal  autocorrelation  matrix  Ri+Jv.  As  a  result, 
the  SINR  (BER)  performance  curves  of  Figs.  2-3  represent  upper  (lower)  bounds  of  the  SINR  (BER)  performance 
that  would  be  attained  if  PDEC  selection  were  based  on  SINR  estimates.  Although  Proposition  4.1  can  form  the 
basis  for  the  development  of  efficient  PDEC  selection  methods,  such  investigation  is  beyond  the  scope  of  this  work. 
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index  j 

Fig.  4.1.  Output  SINR  and  BER  of  PDECs  as  a  function  of  the  index  of  the  signature  or  eigenvector 
that  is  eliminated  from  the  corresponding  FDEC  (synchronous  system). 

4.6  Conclusions 

In  this  work,  we  investigated  the  relative  output  SINR  performance  of  full  and  partial  decorrelators  and  we 
derived  necessary  and  sufficient  conditions  under  which  the  latter  outperform  the  former  in  the  output  SINR 
sense.  We  showed  that  decorrelation  of  MAI  that  satisfies  certain  conditions  should  be  purposefully  avoided. 
As  a  side  study  we  considered  a  blind  implementation  of  the  decorrelator  that  is  based  on  eigendecomposition 
of  the  desired-signal-absent  autocovariance  matrix.  When  estimation  of  the  decorrelator  is  based  on  limited 
data  support,  the  latter  implementation  exhibits  significantly  better  SINR  and  BER  performance  than  the 
decorrelator  that  is  based  on  eigendecomposition  of  the  desired-signal-present  autocovariance  matrix. 
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(a) 


Fig.  4.2.  SINR  and  BER  of  estimated  FDECs  and  PDECs  as  a  function  of  the  data  record  size 
(synchronous  system). 


(a) 


Fig.  4.3.  SINR  and  BER  of  estimated  FDECs  and  PDECs  (best  single  eigenvector  elimination  per 
realization)  as  a  function  of  the  data  record  size  (asynchronous  multipath- fading  channel). 
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Chapter  5 


All-Blind  Adaptive  Non-linear 
Spread- Spectrum  Receivers 

5.1  Introduction 

Most  of  the  research  chapter  on  spread-spectrum  (SS)  signal  detection  in  the  presence  of  multiple  access 
interference  (MAI)  and  additive  noise  is  based  on  the  additive  Gaussian  noise  assumption.  Although  it  is 
often  inaccurate  to  model  the  underlying  noise  of  many  physical  channels  as  Gaussian  [1] ,  such  an  assumption 
is  favored  primarily  because  of  its  mathematical  simplicity  and  tractability.  It  is  well  understood  that  the 
presence  of  non-Gaussian  noise  can  degrade  severely  the  performance  of  conventional  signal  processors  which 
are  based  on  the  Gaussian  assumption.  In  contrast,  when  the  non-Gaussian  nature  of  a  physical  channel 
is  correctly  modeled  and  accounted  for  during  receiver  design,  the  performance  of  the  receiver  may  benefit 
significantly  since  the  Gaussian  noise  is  the  most  entropic  among  all  noise  models  with  full  real/complex 
support  probability  density  function  (pdf)  and  finite  variance. 

Signal  detection  in  non-Gaussian  (impulsive)  noise  has  attracted  research  interest  since  the  early  1960’s 
(see  for  example  the  list  of  references  in  [2]).  In  particular,  SS  signal  detection  in  the  presence  of  non-Gaussian 
noise  was  studied  in  [2] -[6].  In  [2],  the  analysis  of  an  approximately  optimum  single-user  receiver  motivated  a 
general  adaptive  receiver  structure  that  is  composed  of  an  adaptive  front-end  chip-based  nonlinear  processor 
followed  by  an  adaptive  post-processing  linear  filter.  This  general  structure  combines  the  merits  of  both 
linear  and  nonlinear  signal  processing;  the  nonlinear  front-processor  suppresses  the  impulsive  noise  and  the 
adaptive  linear  post-processor  suppresses  the  MAI.  The  chapter  in  [2]  was  further  extended  in  [6]  where 
the  emphasis  of  the  problem  under  consideration  (SS  signal  detection  in  the  presence  of  MAI  and  additive 
non-Gaussian  noise)  was  shifted  to  short-data-record  optimized  receiver  design.  While  in  both  [2]  and  [6] 
the  parameters  of  the  front-end  nonlinear  processor  were  optimized  through  supervised  training,  in  this 
chapter  we  propose  new  blind  parameter  optimization  criteria  and  we  develop  an  all-blind  robust  adaptive 
SS  receiver. 


5.2  Signal  Model 

For  simplicity  in  presentation,  we  assume  a  synchronous  DS-CDMA  communications  system.  The  develop¬ 
ments  in  this  chapter  can  be  extended  in  a  straightforward  manner  to  accommodate  asynchronous  multipath 
fading  transmissions  as  well.  Let  L  and  K  denote  the  system  processing  gain  and  the  number  of  active 
users  in  the  system,  respectively.  We  assume  that  each  user  is  assigned  a  normalized  signature  vector  S/.-, 
k  =  0, 1,  -  -  - ,  K  —  1,  where  S  *,[/]  =  {±1  /VL},  l  =  1,2,  •••  ,L.  The  binary  antipodal  (+1,-1)  information 
bits  are  assumed  to  be  independent  identically  distributed  and  equally  probable.  After  conventional  chip- 
matched  filtering  and  chip-rate  sampling  over  one  information  bit  period  T,  the  baseband  received  signal 
vector  takes  the  form 

K- 1 

r  =  \J. FJkbkSk  +  n  =  y^fepSo  +  I  +  n  (5.1) 

k= 0 
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where  Ek  and  bk  denote  the  power  and  the  information  bit  of  user  k,  I  =  identifies  MAI  and 

n  represents  additive  (non-Gaussian)  noise.  Our  goal  is  to  detect  the  information  bit  of  the  user  of  interest, 
say  user  0. 


5.3  Receiver  Architecture 

For  the  signal  detection  problem  under  consideration,  the  general  receiver  structure  is  shown  in  Fig.  5.1(a) 
[2],  [6].  The  receiver  is  composed  of  a  front-end  chip-based  nonlinear  pre-processor  that  is  followed  by  a 
linear  post-processing  tap- weight  filter. 


(a) 


Fig.  5.1.  (a)  General  receiver  structure,  (b)  Hampel  nonlinearity. 


The  information  bit  decision  is  bo  =  sgn[z],  where  z  =  wTg(r;  c)  is  the  linear  tap- weight  filter  output, 
superscript  T  denotes  transpose,  and  sgn[-]  is  the  sign  operator.  The  nonlinear  pre-processor,  g(r;  c)  = 
[g(r-[ ;  c)  •  •  •  g(ri,\  c)]T,  considered  in  this  chapter  is  of  Hampel-type: 

g(x;  c)  =  g(x;  ci,  c2,  c3) 

'  x, 

A  Cl  |.C|  > 

—  <  C3-P|  x_ 

C3-C2  C1  |*|  ’ 

0, 

where  c  =  [ci,  C2,  c^]T  is  the  cutoff  parameter  vector  to  be  obtained  adaptively  (0  <  c\  <  C2  <  C3).  In  (5.2),  x 
is  a  complex  number  and  \x\  denotes  the  magnitude  of  x.  The  Hampel  nonlinearity  preserves  the  received 
measurements  whose  magnitude  is  less  than  the  threshold  value  c\  and  either  de-emphasizes  or  discards 
higher  magnitude  measurements  when  c\  <  |a;j  <03  or  |a:|  >03,  respectively.  The  real  valued  version  of  (5.2) 
is  shown  in  Fig.  5.1(b).  We  note  that  if  the  cutoff  parameters  are  such  that  0  <  ci  =  C2  =  C3,  then  the  Hampel 
nonlinear  processor  degenerates  to  the  puncher.  As  C2  — »  00  the  Hampel  operator  converges  to  the  clipper. 
With  respect  to  the  linear  filter  post-processor  w,  we  suggest  as  candidates  and  we  investigate  both  the 
minimum-variance-distortionless-response  (MVDR)  and  the  auxiliary-vector  (AV)  filters.  We  recall  that  the 
MVDR  filter,  w MVDR,  that  is  distortionless  in  the  vector  direction  of  the  user  signal  of  interest  So  is  given 


if  |a;|  <ci 
if  ci  <  |a;|  <C2 
if  c2  <  |z|  <c3 
otherwise 


(5.2) 
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by 


(5.3) 


R-c  So 
W  mvdr  —  gTt}-1c 
Sg  ±1C  &0 

where  Rc  =  E{g(r;  c)gT(r;  c)}  is  the  autocovariance  matrix  of  the  Hampel  nonlinearly  processed  input 
data  (£?{•}  denotes  the  statistical  expectation  operation).  On  the  other  hand,  the  AV  algorithm  generates  a 
sequence  of  AV  filters,  w^,  n  =  0, 1,  •  •  • ,  that  are  distortionless  in  the  Sg  direction  as  follows: 


w(0) 

WAV 

=  Sg. 

(5.4) 

For  n 

=  1.2,--- 

G(n) 

_  -p  ,,,(n— !)  cT p  (n— 1) Q 

—  -tlcW_4v  Oq  xlcw av  k5o, 

(5.5) 

G(")TRcwir1) 

G(")TRCG(")  ’ 

(5.6) 

( n ) 
W  AV 

=  wiV15  -ft(n)G(n). 

(5.7) 

We  review  briefly  the  basic  properties  of  the  sequence  of  AV  filters  W AV,---,  w AV,---  [7].  Under 
perfectly  known  input  statistics  (matrix  Rc),  the  sequence  converges  to  the  ideal  MVDR  solution  w mvdr  in 
(5.3).  When  the  autocovariance  matrix  Rc  is  estimated  from  a  finite  data  record  through  sample  averaging, 
Rc  =  X)n=i  g(r«;  c)gT(r„;  c),  then  the  early  non-asymptotic  elements  of  the  sequence  of  AV  filter  estimates 
•  offer  favorable  bias/variance  balance  and  outperform  in  mean-square  (MS)  filter  estimation 
error  the  sample-matrix-inversion  (SMI)  MVDR  filter  estimate.1  Therefore,  we  need  to  emphasize  that  the 
sequence  w^2,),  •  •  •  can  be  viewed  not  only  as  a  sequence  of  estimates  of  the  ideal  AV  filters  but,  most 
importantly,  as  a  sequence  of  estimates  of  the  ideal  MVDR  filter.  Data-record-based  criteria  for  the  selection 
of  the  best  AV  estimate  of  the  MVDR  filter  were  recently  reported  in  [8]. 


5.4  Algorithmic  Developments 


In  [2],  [6],  the  cutoff  parameter  vector  c  of  the  front-end  nonlinearity  g(r;  c)  was  obtained  through  supervised 
training  according  to  either  a  mean-square-error  (MSE)  or  a  bit-error-rate  (BER)  optimization  criterion.  In 
this  chapter  we  develop  and  analyze  blind  techniques  for  the  estimation  of  the  cutoff  parameter  vector  of 
the  nonlinear  processor.  2 

We  propose  to  use  as  a  criterion  for  the  selection  of  the  parameter  vector  c  the  Fisher  discriminant 
function  [9]  evaluated  at  the  output  of  the  linear  post-processor  filter.  The  Fisher  discriminant  function 
J-(c,  w)  for  the  SS  signal  model  under  consideration  takes  the  form 


^(c,w) 


2/i2(c,  w) 
*/+n(c,w) 


(5.8) 


where  c  denotes  the  cutoff  parameter  vector,  w  denotes  a  given  linear  post-processor  filter,  p(c,  w)  denotes 
the  conditional  mean  of  the  linear  filter  output  conditioned  on  bo  =  +1,  and  cr2+ii(c,  w)  is  the  corresponding 
output  variance  due  to  MAI  and  noise.3  We  note  that  if  w  is  an  adaptive  filter  such  as  w MVDR  or  wAV,  then 
it  depends  on  the  parameter  c  through  the  input  autocovariance  matrix  Rc.  Maximization  of  the  Fisher 
discriminant  function  with  respect  to  c  aims  at  creating  two  well  separated  clusters  of  filter  outputs,  one 
corresponding  to  6g  =  +l  and  the  other  corresponding  to  6g  =  — 1. 

*w j?y,n  =  1,2,---,  are  obtained  from  (5.4)-(5.7)  with  Rc  in  place  of  Rc.  Similarly,  wsmi-mvdr  =  _^° 

s0  rc  So 

from  (5.3).  For  a  fixed  data  record  lim  =  wsmi-mvdr  [7]. 

n — >oo 

2We  recall  that  the  evaluation  of  wsmj-mvdr  or  w Av  is  inherently  unsupervised  (blind). 

3The  linear  filter  output  conditioned  on  bo  =  —1  has  mean  — p(c,w). 
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Exploiting  the  symmetry  of  the  conditional  pdf’s  of  the  linear  filter  output  given  bo  =  +1  and  bo  =  —  1 
(recall  that  the  Hampel  nonlinearity  is  also  symmetric),  we  can  show  in  a  straightforward  manner  that 


_  2 E2  {60w(Tg(r;  c) } 
Var{b0  w^g(r;c)} 


(5.9) 


where  Var(-)  denotes  variance  and  the  filter  subscript  “c”  serves  as  a  reminder  that  the  filter  is  a  function 
of  c.  The  Fisher  discriminant  function  in  (5.9)  is  of  supervised  nature,  which  is  exactly  what  we  would 
like  to  avoid  in  our  developments.  We  propose  to  design  a  blind  (or  decision  directed)  version  of  (5.9)  by 
substituting  bo  by  sgn  [w(Tg(r;  c)]  (output  of  the  sign  detector  that  follows  the  linear  filter).  Then, 


2E2{sffn[wjTg(r;  c)]w(Tg(r;  c)} 
Var{sgn[ w(Tg(r;  c)]w^g(r;  c)} 

2£2{|w?g(r;  c)|} 

V ar{|w^g(r;  c) | } 


(5.10) 

(5.11) 


where  the  subscript  “B”  identifies  the  blind  version  of  the  Fisher  discriminant  function.  The  following 
proposition  identifies  conditions  under  which  J-s(c,  wc)  is  nearly  equal  to  JF(c,wc). 


Proposition  5.1  To  the  extent  that  the  linear  filter  output  conditional  distidbutions  (conditioned  on  the 
transmitted  information  bit  of  the  user  of  interest)  can  be  approximated  by  Gaussian  pdf ’s,  if  ^  1, 

i.e.  the  filter  output  SINR.  is  significantly  higher  than  OdB,  then  Tb{ c,  wc)  ps  T(c,  wc).  □ 


To  estimate  Tb{ c,  wc)  from  a  data  record  of  finite  size  we  substitute  the  statistical  expectations  in 
(5.11)  by  sample  averages  and  we  denote  the  estimate  by  (Fb{ c,  wc).  The  following  criterion  summarizes  the 
procedure  for  the  selection  of  the  cutoff  parameter  c. 


Criterion  5.1  For  a  given  data  record  of  size  N,  the  Fisher  discriminant  selection  rule  chooses  the  cutoff 
parameter  c  such  that 

c0  =  argmax )Fb(c,  wc)  (5.12) 

C  □ 


Some  critical  observations  now  follow.  We  understand  that  Criterion  5.1  introduces  a  coupled  maxi¬ 
mization  problem  that  can  be  solved  by  utilizing  a  gradient  type  adaptation  for  the  parameter  c.  In  each 
iteration  step  the  adaptation  of  c  is  followed  by  the  evaluation  of  wc  (be  it  w Smi-mvdr,c  or  wAV,c).  We 
also  emphasize  that  the  Fisher  discriminant  function  is  used  for  the  selection  of  the  cutoff  parameter  c  only, 
while  the  linear  filter  is  determined  strictly  according  to  the  MVDR  or  AV  filtering  principles.  The  use  of 
the  Fisher  discriminant  criterion  J~b (■,  ■)  to  select  both  c  and  w  would  return  phase  ambiguous  filters  that 
may  be  locked  on  a  strong  SS  interferer  instead  of  having  a  coherent  lock  on  the  signal  of  interest.  Finally, 
we  note  that  the  Fisher  discriminant  function  for  the  case  of  conditionally  Gaussian  linear  filter  outputs 
coincides  with  the  J-divergence  distance  4  between  the  corresponding  pdf’s  [8]. 

To  reduce  the  computational  complexity  of  the  above  coupled  optimization  approach  due  to  the  adaptive 
evaluation  of  the  linear  post-processor  at  each  iteration  step,  we  examine  the  following  decoupled  optimization 
scheme.  In  the  first  step,  we  select  the  parameter  c  that  maximizes  (Fb( c,  wmf)  (i.e.  the  linear  filter  is  fixed  at 
the  user  of  interest  signature-matched  filter  wMF  =  So).  In  the  second  step,  given  the  value  of  the  parameter 
c  returned  by  the  first  step,  say  Ci,  we  calculate  the  adaptive  filter  wSMJ_MVi>fj,Cl  or  wJV,Cl  based  on  the 
input  autocovariance  matrix  RCl.  These  two  simple  steps  complete  the  decoupled  scheme.  There  are  two 
issues  associated  with  the  above  two-step  decoupled  procedure  that  need  to  be  addressed.  The  first  issue 
involves  the  direction  at  which  both  the  coupled  and  the  decoupled  gradient  method  for  the  evaluation  of  c 
is  moving.  This  issue  is  treated  in  the  following  proposition.  The  second  issue  involves  possible  performance 
loss  due  to  the  decoupled  optimization  and  it  is  examined  through  simulations  in  the  next  section. 

4The  J-divergence  distance  J(/i,  f2)  between  two  distributions  /i(-)  and  f2()  is  defined  as  the  sum  of  the  Kullback- 
Leibler  (K-L)  distances  between  fi  and  f2:  J{fi,  f2)  =  D(f i,  f2)  +  D{f2,  fi)  where  ZJ(/i,  f2)  =  fi{x)  log  j^dx 
is  the  K-L  distance  from  f2()  to  /i(-). 
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Proposition  5.2  To  the  extent  that  the  nonlinear  pre-processor  suppresses  only  the  impulsive  noise  com¬ 
ponent  while  it  passes  undistorted  the  user  of  interest  as  well  as  the  MAI  component,  both  steepest  descent 
recursive  methods  (coupled  and  decoupled)  for  the  determination  of  c  increase  the  value  of  the  Fisher  dis¬ 
criminant  function  at  each  iteration  step. 

Proof:  We  can  show  in  a  straightforward  manner  that  if  Ci  and  C2  are  two  cutoff  parameter  vectors  for 
which  Fb(c-\,w MF)  >  JFb(c 2,wmf),  then 


<D2+„,9(g(r;ci))  <  'Tj+n,9(g(r;  c2))* 

(5.13) 

<72,9(g(Dci))  <  <T2i9(g(r;c2)), 

(5.14) 

•^fl(CllWSMI-MTOJi,Ci)  >  d~B  (C1 SMI-MVDR.C2  ) 

>  d~B  {c2iWSmi-mvdr,c2  ); 

(5.15) 

TB{ Ci,WAV,Cl)  >  Fb(ci,  Wav,o2)  >  Fb{C2,Wav,C?) 

(5.16) 

where  of+ri  and  a)  denote  the  squared  norm  of  a  vector  with  elements  the  variances  of  the  chip-nonlinearly 
processed  samples  of  interference  and  noise  (7  +  n )  or  just  noise  (n),  respectively.  □ 


5.5  Simulation  Results 

We  consider  a  system  of  6  synchronous  SS  signals  with  processing  gain  31.  The  normalized  signature  cross¬ 
correlations  between  the  interfering  signals  and  the  signal  of  interest  are  in  the  range  of  0.22  to  0.35  while 
the  interfering  signal  signatures  are  approximately  orthogonal  to  each  other.  The  power  of  the  interfering 
users  (users  1-5)  is  fixed  at  9,  10,  11,  12  and  13  dB,  respectively.  We  also  assume  the  presence  of  additive 
non-Gaussian  noise  under  the  familiar  e-contamination  model 

fe(x)  =  (1  -  e)f0(x)  +  e/i(z)  (5.17) 

where  fo(x)  denotes  the  ambient  background  Gaussian  noise  pdf.  The  contamination  parameter  e  G  [0,1] 
indicates  the  probability  that  the  chip  noise  samples  are  fi(x)  distributed.  We  assume  that  fo(x)  and  fi(x) 
are  zero-mean  Gaussian  with  variance  <7q  =  1  and  cr)  =  72<7q  (72  =  1000),  respectively,  and  we  set  e  equal  to 

0.2. 

Fig.  5.2  examines  the  BER  performance  as  a  function  of  the  data  record  size  of  the  conventional  MF 
and  SMI-MVDR  receivers  (without  nonlinear  pre-processing)  and  the  Hampel  pre-processor  followed  by 
either  the  SMI-MVDR  or  the  AV- filter  estimator  wAv.  The  Hampel- MF  performance  curve  is  also  included 
for  reference  purposes.  The  power  of  the  signal  of  interest  is  fixed  at  12  dB.  The  solid  lines  indicate  the 
performance  of  the  coupled  procedures  while  the  dotted  lines  show  the  performance  of  the  corresponding 
decoupled  procedures.  Fig.  5.3  reports  the  same  studies  as  in  Fig  as  a  function  of  the  power  of  the  signal  of 
interest.  The  estimation  of  the  cutoff  parameter  c  and  the  estimation  of  the  adaptive  filter  (w SMI-MvnR,c 
or  wAv,c)  is  based  on  the  same  record  of  128  data  samples.  The  results  in  both  Figs.  2  and  3  are  averages 
over  150  independent  Monte-Carlo  runs.  We  observe  that  the  decoupled  procedures  do  not  suffer  substantial 
performance  loss  while  they  maintain  significantly  lower  computational  complexity  when  compared  to  the 
coupled  procedures.  We  also  note  the  performance  gains  that  the  Hampel  AV-filter  structure  exhibits  in  low 
sample  support  situations  when  compared  to  the  Hampel  SMI-MVDR  filter. 
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Fig.  5.2.  BER  as  a  function  of  the  data  record  size. 


Fig.  5.3.  BER  as  a  function  of  the  power  of  the  user  of  interest. 
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Chapter  6 


Performance  Analysis  of  Joint  and 
Disjoint  Space/Time  Receivers  in  Fast 
Frequency- Nonselect  ive  Fading 
DS-CDMA  Channels 

6.1  Introduction 

In  this  chapter  we  analyze  the  performance  of  DS-CDMA  single-user  antenna-array  receivers  that  utilize 
either  a  cascade  of  spatial  and  temporal  linear  filters  or  one  joint  space-time  linear  filter.  Receiver  filters 
are  estimated  from  limited  sample  support  due  to  a  fast  frequency-nonselective  fading  channel  considered. 
Linear  filters  are  preferred  due  to  their  conceptual  simplicity  and  low  computational  cost.  Among  them, 
the  conventional  matched  filter  (MF)  receiver  has  the  lowest  complexity  but  exhibits  poor  performance  in 
strong  interference  environments  or  when  the  user  of  interest  is  highly  correlated  with  the  interferers.  On 
the  other  hand,  the  linear  minimum-mean-square-error  (MMSE)  and  the  minimum-variance-distortionless- 
response  (MVDR)  receivers  strike  a  balance  between  complexity  and  performance  [1].  We  consider  both 
MF-type  and  MMSE/MVDR-type  filters,  and  we  use  pre-detection  signal-to-interference-plus-noise-ratio 
(SINR)  as  the  figure  of  merit.  To  simplify  the  analysis,  we  confine  ourselves  within  the  class  of  filters  that 
are  distortionless  in  the  signal  direction  of  the  user  of  interest.  Our  goal  is  twofold:  (i)  Quantify  the  receiver 
output  variance  and  receiver  output  SINR  performance  of  disjoint  and  joint  space-time  configurations  given 
ideal  input  statistics;  (ii)  quantify  the  confidence  level  in  a  certain  neighborhood  of  the  optimal  performance 
point  of  the  corresponding  estimated  structures  that  are  evaluated  based  on  limited  sample  support. 

6.2  Signal  Model 

We  consider  a  synchronous  DS-CDMA  communication  system.  Let  L  and  K  denote  the  system  processing 
gain  and  the  number  of  active  users  in  the  system,  respectively.  We  assume  that  each  user  is  assigned 
a  normalized  signature  vector  s*,,  k  =  1,  2,  •  •  •  K,  where  Sf.[l]  =  {±1  /\fL},  l  =  1,2,  •••  ,L.  The  receiver 
employs  a  uniform  linear  antenna  array  with  M  antenna  elements,  spaced  at  half-the-wavelength  apart.  If 
the  angle-of-arrival  (AoA)  of  user- A:  is  denoted  by  6k,  then  the  array  response  vector  (spatial  signature)  of 
user-A:  is  defined  as  a*.  =  [l  e~i7rslnek  ...  e_:'*M_1^7rsln6,,!]T. 

Without  loss  of  generality,  we  assume  chip  synchronization  with  the  user  of  interest  ( user-1 )  at  the 
reference  antenna  element  {in  =  1).  After  conventional  chip  matched  filtering  and  chip  rate  sampling,  we 
can  visualize  the  space-time  data  samples  in  the  form  of  a  complex  matrix  X  £  CMxL 

XMx*(i)  =  [xmx  i(iff  1)  xmx  i(lAt  2)  ...  xMx1(zL  +  L)\  (6.1) 

where  the  column  vector  xMxl(iL  +  l)  associated  with  the  Zth  chip  sample  within  the  ith  information  bit 
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period  is 


K 

xMxi(iL  +  l)  =  bk (i) ^EkCk (Qafe  +  nMxl(iL  +  /),  l  =  1,.  ..,L.  (6.2) 

fc=l 

In  (6.2),  Ef,,  bk(i)  and  c;.(?)  denote  the  energy,  the  ith  binary  information  bit  and  the  Zth  signature  chip  of 
user-A:,  respectively,  n Mxl(iL  +  l )  ~  A/”(0Mxl,  <t2Im)  is  an  AWGN  vector  (IM  denotes  the  M  x  M  identity 
matrix) . 

Let  Sk  =  Sfc  ®  &k  be  the  joint  space-time  signature  of  user-A:,  where  Sk  6  CMi  and  ®  denotes  the 
Kronecker  product.  Let  also 

A-mxK  =  [al  a2  •  •  •  aA']  ;  Stxif  =  [S1  S2  •  •  •  Sa]  ; 

SjtLxtr  =  [§i  ®2  •  •  •  §a]  ;  Ejs-xjs-  =  diag  ( E\ .  E^ ,  •  •  • ,  Ex) , 


B(i)jfXif  =  diag  (bi(i),  b2(i ),  ...,bK(i)), 

N(i)Mx£,  ==  [n(iL  +  1)  ...  n (iL  +  L)\, 

b(i)jfXi  =  [bi(i),  b2(i),-..,  bK{i)]T  ■ 

(6.3) 

Then,  (6.1)  can  be  written  compactly  as 

X(i)  =  AE1/2B(i)ST  +  N(i). 

(6.4) 

We  note  that  bold  upper-case  letters  denote  matrices  while  bold  lower-case  letters  denote  vectors.  Super¬ 
scripts  T,  H  and  *  denote  transpose,  Hermitian  and  conjugation  operation,  respectively. 

In  the  following  section,  we  summarize  the  receiver  configurations  that  we  consider  in  this  chapter  for 
one-shot  detection  of  the  information  bit  of  the  user  of  of  interest  (the  index  i  in  (6.4)  is  dropped  for  simplicity 
in  notation). 


6.3  Joint  and  Disjoint  Space-Time  Filter  Configurations 

While  in  disjoint  domain  processing  we  operate  on  the  data  matrix  X  directly,  to  facilitate  joint  domain 
processing,  we  vectorize  the  matrix  XMx£,  by  stacking  all  columns  to  form  a  vector  as  follows: 

=  Vec{XMx4  =  SE^b  +  n,  (6.5) 


where  n  =  Vec{NMxI,}. 

Let  Wj  G  CMi,  ws  E  CM,  and  wt  e  CL  denote  an  arbitrary  joint  space-time,  spatial  and  temporal  linear 
filter,  respectively.  Information  bit  decisions  are  made  by  taking  the  sign  of  the  real  part  of  the  final  output 
of  the  corresponding  receiver  configuration,  i.e.  information  bits  are  estimated  by  &i  =  sgn  [iie{w  ?*}]  in 
joint  domain  filtering,  or  by  b\  =  sgn  [-Rejw^Xwt}]  in  disjoint  domain  filtering.  We  also  assume  that  the 
filters  Wj,  ws  and  wt  are  distortionless  in  the  joint  space-time,  spatial  and  temporal  direction  of  the  user 
of  interest,  respectively,  i.e.  w" Si  =  1,  wfai  =  1,  and  wf  Si  =  1.  Let  r(fl)  and  -P(fl)  denote  output  SINR 
and  output  variance,  respectively,  of  an  arbitrarily  chosen  configuration  Q  .  Since  r(fl)  =  E1/  [-P(fl)  —  E j], 
then  lower  receiver  output  variance  implies  higher  receiver  output  SINR. 

The  joint  space-time  MF  wJMF  and  the  joint  MVDR  filter  wJMV  [2]  for  user-1  are  given  by 


w  jMF  =  Si/M, 


W  JMV 


R  *  Si 

sfR-1^’ 


(6.6) 


where  R  =  E{XXH}  =  SESff  +  u2I  is  the  auto-covariance  matrix  of  the  space-time  received  vector, 
E{-}  denotes  statistical  expectation  operation.  The  output  variance  of  the  joint  matched  filter  is  PJMF  = 
s^Rsi/M2,  while  the  output  variance  of  the  joint  MVDR  filter  is  PJMv  =  (§^R-1§i) 

On  the  other  hand,  disjoint  domain  processing  can  take  the  form  of  spatial  filtering  followed  by  temporal 
filtering  or  temporal  filtering  followed  by  spatial  filtering.  Clearly  spatial  MF  combined  with  temporal 
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MF  in  any  order  is  equivalent  to  joint  MF  (JMF).  Below  we  list  all  other  possible  configurations  (in  the 
processing  order) ,  and  evaluate  the  corresponding  filters  as  well  as  the  output  variance  of  each  configuration, 
as  a  function  of  M,  L,  E,  <r2,  A  and  S  (in  each  configuration  the  left  operation  is  the  type  of  processing 
performed  first).  In  the  following,  the  letter  s  and  t  when  used  as  part  of  the  subscript  of  a  vector  or  matrix 
indicates  spatial  and  temporal  filtering,  respectively,  while  MF  and  MV  refers  to  MF  and  MVDR  type 
processing.  The  symbol  |  is  used  as  a  “condition”  indicator  (e.g.  tMV\sMF  implies  that  temporal  MVDR 
processing  follows  spatial  MF  processing). 

(i)  Spatial  MF  -  temporal  MVDR 

ws  =  weMF  =  &i/M, 

(6.7) 

W(MV|sMF  ^t2|sAfF®l/  (®1  ^*2|sMF®l)  5 

(6.8) 

PtMV\sMF  =  [S1  R-(2|«MF-Sl]  ) 

(6.9) 

where  Rt2|.MF  =  E  {(w^X)H(w^X)} 

—  S*E1'/2diag  (  A  ^,A)e^+> 

(ii)  Spatial  MVDR  -  temporal  MVDR 

ws  =  waMV  =  R-?  a,  /  (af  R^a^  , 

(6.10) 

WtMV\sMV  (®1  ^t2|sMV®l)? 

(6.11) 

PtMV\eMV  =  [S1  I^t2|«MVSl]  > 

(6.12) 

where  Rsi  =  E  {xxff}  =  yAEAff  +  cr2I, 

1 1 

R.f 2 1 smv  A  E  {(wfMVX)ff(wfMVX)} 

=  S*E1/2diag  (AHweMVw^MVA)  E1/2St 
+  ||woMV-||2o-2I. 

(iii)  Spatial  MVDR  -  temporal  MF 

ws  =  weMV,  wf  =  WfMJ?  =  Sl, 

PtMF\sMV  ~  Rt2|sMV®l* 

(6.13) 

(iv)  Temporal  MF  -  spatial  MVDR 

w*  =  Si, 

WsMl|fMF  ■^s2|tMF'^l/  (*R  ^^s2  1 1 M  )  5 

(6.14) 

PaMV\tMF  [*R  -^s2|tMJF^-l]  5 

(6.15) 

where  R«2,tMF  =  E  {(XwtMF)(Xw1MF)fl } 

=  AE^diag  (STslSf  S*)  E1/2Aff  +  <r2I. 

(6.16) 

(v)  Temporal  MVDR  -  spatial  MVDR 

w t  =  wtMv  =  R-f“ilsi/  (sf R-alsi) > 

(6.17) 

W sMV\tMV  (*R 

(6.18) 

PsMV\tMV  [<R  I^s2  ? 

(6.19) 

where  Ru  =  E  {XffX}  /M  =  S*EST  +  <r2I, 

R.s2  \tMV  =  E  {(Xw,m)(Xw,Mf)ff} 
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=  AEi/2diag  (STwtMVw?MVS*)  Ei/2AH 
+  ||wfMV||2o-2I. 


(vi)  Temporal  MVDR  -  spatial  MF 


w  t  =  wtMV,  ws  =  ai/M, 

PsMF\tMV  =  ai  R-«2|tMV'al/^  •  (6.20) 

Let  us  define  the  relative  performance  measure  ^ti(fl)  =  P(JMV)  /  P(fl),  where  fl  denotes  an  arbitrary 
receiver  configuration  and  JMV  denotes  the  joint  space-time  MVDR  receiver  configuration.  Then,  0  < 
/ii(f 2)  <  ni(JMV)  =  1,  while  high  ^ti(D)  indicates  low  output  variance  P(Ll)  (or,  equivalently,  high  output 
SINR).  In  the  following  we  will  use  ^ti(fl)  to  rank  the  above  mentioned  configurations  in  terms  of  total 
output  SINR. 


6.4  Receiver  Performance  under  Limited  Sample  Support 

In  practice,  the  input  statistics  R  as  well  as  Rs  and  R*  are  not  known,  but  estimated  from  received  data 
vectors,  e.g.  R  =  y?  )Cj;=i  AjAV,  where  N  denotes  the  size  of  the  data  record.  The  estimated  MVDR 
filter  that  utilizes  the  inverse  of  a  sample- average  estimate  of  the  autocovariance  matrix  is  known  as  the 
sample- matrix- inversion  (SMI)  filter.  In  the  following,  w  and  P(-)  refer  to  an  estimate  of  w  and  the  output 
variance  of  a  configuration  that  utilizes  filter  estimates,  respectively. 

Let  jtt2(D)  =  P(fl)/ P(Q),  and  /i(fl)  =  Mi(^)tt2(^)-  Then,  /.t(fl)  =  P(JMV)/ P(Q)  characterizes  the 
output  variance  of  the  “estimated”  configuration  Q  relative  to  the  ideal  JMV  configuration.  Evidently, 
higher  /i(fl)  implies  higher  output  SINR  exhibited  by  the  estimated  configuration. 

In  [3]  it  was  shown  that  under  the  assumption  that  the  received  data  vectors  are  independent  and 
identically  distributed  according  to  a  multivariate  complex  Gaussian  distribution,  then,  for  any  given  £  e 
(0, 1)  and  N  >  ML, 

Pr[^{JMV)  >  1  -  C]  *  1  -  Q  (Ay^L^2)  >  <6-21) 

where  Q(x)  =  J—  J)°°  e~ *  1 2dt  is  the  complementary  Gaussian  cumulative  distribution  function. 

In  the  following  theorem,  we  extend  the  result  in  (6.21)  for  all  disjoint  domain  receiver  configurations 
presented  in  the  previous  section. 


Theorem  6.1  For  any  given  (  G  (0, 1)  and  N  >  max(M,  L), 


Pr  M tMV\sMF )  >  1  -  C]  *  1  -  Q 
Pr  MtMFl,MV)  >  1  —  C]  » 1  —  Q  ( + 

Pr  [M2( sMV\tMF )  >  1  -  C]  »  1  -  Q 

Pr  W,M  F\tMV)  >  1  -  C]  M 1  -  q  (  ) 


Pr  \jjL2{tMV\sMV)  >  1  —  C] 


1  -Q 
1  -Q 


(  LN(-M  +  2\ 

[y/LNCil-  T)J_ 

(  NC-L  +  2' 

WNai-o/ 


(6.22) 

(6.23) 

(6.24) 

(6.25) 


(6.26) 
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Pr[p2(sMV\tMV)  >  1  —  C] 


(  MN(  -  L  +  2  \ 

VV^c(i-c)7 


1  -Q 


N(-  M  +  2\ 

-o ) 


(6.27) 

□ 


Using  Theorem  6.1  we  can  evaluate  the  data  record  size  that  is  necessary  to  achieve  a  certain  confidence 
level  e  by  the  estimated  configuration  in  a  neighborhood  £  of  the  ideal  configuration,  which  can  be  expressed 
by  Pr  \ji 2(fl)  >  1  — C]  >  e  f°r  any  given  £  6  (0, 1)  and  e  G  (0, 1).  The  results  are  summarized  in  Theorem  6.2 
below  where  the  last  statement  (6.33)  is  due  to  [3]  (the  proof  is  omitted).  For  convenience  in  presentation 
we  define  the  following  operation: 

— ,  if  e  <  0.5 
if  e  >  0.5. 


©  = 


(6.28) 


Theorem  6.2  For  any  given  £  G  (0, 1)  and  e  G  (0, 1),  the  least  number  of  samples  that  guarantees  Pr  [fi2  >  1  —  £]  > 
e  can  be  approximated  by 


NtMV\sMF  —  TYICLX  ^  -Zv, 
N tMF\sMV  —  TRCLX 


M 

T' 


N sMV\tMF  —  TYICLX  ^  M , 

f  L 

N sMF\tMV  —  maX  \ 

NJMV  =  max  [ml, 


(1  -  £)V>2  +  2 (L  -  2)  0  yg(T  -  £)[4L  -  8  +  (1  -  (W] 

2£ 

'(1  -  £)V>2  +  2 (M  -  2)  ©  y/^{l  -  £)[4M  -  8  +  (1  -  £)^2]' 

->/•£ 

(1  -  +  2 (M  -  2)  ©  x/V'2(l-C)[4M-8+(l-£)V'2] 

2£ 

'(1  -  C)V>2  +  2 (L  -  2)  ©  vg(T  -  £)[4L  -  8  +  (1  -  £)^2] 

2  M( 

(1  -  OV'2  +  2 (ML  -  2)  ©  VV’2(l-£)[4ML-8+(l-£)V’2] 

2£ 


(6.29) 

(6.30) 

(6.31) 

(6.32) 

(6.33) 


where  t[>  =  Q  X(1  —  e).  [a:]  denotes  the  smallest  integer  larger  than  or  equal  to  x. 
We  note  that 

1-C 


Pr  ln{il)  >!-(}  =  Pr 


l-i2(fl)  > 


=  Pr 


qi2(fl)  >  1  — 


/ii(fl)  +  £  —  1 

jtti(U) 


(6.34) 


Using  Theorem  6.2  we  can  evaluate  Nf  such  that  Pr  [/i(U)  >  1  —  £]  >  e,  i.e.  evaluate  the  data  record  size 
N'q  that  is  necessary  to  achieve  a  certain  performance  confidence  level  e  by  the  estimated  configuration 
in  the  neighborhood  £  of  the  optimum  (JMV)  configuration.  This  can  be  accomplished  by  substituting 
£'(U)  =  1  -  into  eqs.  (6.29)-(6.32). 


6.5  Simulation  Results 


We  consider  a  synchronous  DS-CDMA  system  with  K  =  9  users  and  processing  gain  L  =  15.  The  temporal 
signature  cross-correlations  are  in  the  range  of  [0.07,  0.33].  We  consider  user-1  as  the  user  of  interest.  We 
assume  that  the  AoAs  of  all  users  are  equally  spaced  in  [— f ,  f]  and  6\  =  0.  We  consider  two  scenarios  with 
respect  to  different  power  profile. 
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Scenario  (i):  Perfect  power  control 

All  users  have  the  same  input  SNR  fixed  at  13dB,  including  the  antenna  array  gain  evaluated  as  lOlog(M). 
The  results  are  tabulated  in  Table  6.1  and  Table  6.2. 

Table  6.1  indicates  that  for  a  given  L,  if  M  is  much  smaller  than  L,  then  the  tMV|sMF  configuration 
outperforms  all  other  disjoint  configurations  with  high  probability.  On  the  other  hand,  when  M  is  much 
larger  than  L  then  the  sMV|tMF  configuration  performs  the  best  among  all  disjoint  configurations  with  high 
probability.  Similar  performance  characteristics  were  also  observed  in  [5]. 


Table  6.1.  Output  SINR  performance  under  ideal  statistics 


configuration 

M  =  2 

M  =  5 

M  =  10 

M  =  18 

JMV 

15.10 

19.65 

19.82 

19.94 

tMV  sMF 

10.12 

15.85 

17.81 

19.74 

tMF  sMV 

3.82 

14.67 

18.81 

19.86 

tMvjsMV 

10.12 

15.58 

18.84 

19.86 

sMV  tMF 

3.82 

15.80 

18.91 

19.88 

sMF  tMV 

6.78 

13.28 

16.24 

18.56 

sMvjtMV 

6.79 

13.34 

16.70 

18.62 

JMF 

3.82 

15.21 

17.50 

19.74 

Table  6.2  provides  the  minimum  number  of  data  samples  needed  by  each  configuration  to  achieve,  with 
confidence  level  e,  receiver  output  SINR  in  a  neighborhood  f  of  the  optimum  JMV  performance,  for  different 
values  of  e  and  (.  An  empty  entry  indicates  that  the  corresponding  configuration  can  never  reach,  with 
confidence  e,  an  SINR  performance  level  that  lies  within  distance  f  from  the  JMV  performance.  We  observe 
that  when  a  disjoint  configuration  can  achieve  output  SINR  with  confidence  level  e  in  a  neighborhood  (  of 
the  JMV  performance,  it  requires  much  less  data  support  than  the  JMV  configuration  does. 


Table  6.2.  Minimum  number  of  data  samples 


N1 

M  =  10 

M  =  18 

C=0.3 

e=0.9 

C=o.i 

e=0.9 

C=0.3 

e=0.9 

C=o.i 

e=0.9 

JMV 

539 

1636 

954 

2887 

tMV  sMF 

81 

— 

60 

201 

tMF  sMV 

3 

16 

5 

15 

sMV  tMF 

45 

219 

71 

224 

sMF  tMV 

9 

— 

5 

37 
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Scenario  (ii):  Unequal  power  users 

We  use  the  same  setup  as  in  Scenario  (i)  except  that  now  the  SNR  of  users  6-9  is  fixed  at  23dB  (while  the 
SNR  of  users  1-5  is  kept  at  13dB  as  before).  In  other  words,  users  6-9  have  a  lOdB  power  advantage  relative 
to  the  user  of  interest.  The  simulation  results  are  tabulated  in  Table  6.3  and  Table  6.4. 


Table  6.3.  Output  SINR  performance  under  ideal  statistics 


configuration 

M  =  2 

M  =  5 

M  =  10 

M  =  18 

JMV 

14.99 

19.64 

19.82 

19.94 

tMV  sMF 

8.91 

13.06 

15.31 

18.94 

tMF  sMV 

1.20 

9.43 

18.51 

19.83 

tMvjsMV 

6.32 

11.58 

18.53 

19.83 

sMV  tMF 

1.25 

9.98 

18.59 

19.85 

sMF  tMV 

5.68 

10.40 

12.93 

15.20 

sMVtMV 

5.83 

10.57 

13.39 

15.26 

JMF 

0.87 

8.00 

11.03 

18.81 

Table  6.4.  Minimum  number  of  data  samples 


N' 

M  =  10 

M  =  18 

C=0.3 

e=0.9 

C=o.i 

e=0.9 

C=0.3 

e=0.9 

C=o.i 

e=0.9 

JMV 

539 

1636 

954 

2887 

tMV  sMF 

194 

— 

67 

349 

tMF  sMV 

4 

23 

5 

16 

sMV  tMF 

47 

308 

71 

227 

sMF  tMV 

— 

— 

15 

— 
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Chapter  7 


Near-ML  Multiuser  Detection  with 
Linear  Filters  and  Reliability-based 
Processing 

7.1  Introduction 

In  direct-sequence  code-division-multiple-access  (DS-CDMA)  systems,  multiple  users  transmit  information 
bit  sequences  over  a  common  channel  utilizing  distinct  individually  assigned  signature  waveforms.  In  the 
presence  of  strong  interferers  and/or  high  cross-correlation  signature  waveforms,  the  performance  of  the 
conventional  single-user  signature  matched-filter  detector  degrades  significantly.  On  the  other  hand,  if  the 
user  signatures,  energies,  and  the  channel  noise  power  are  known  at  the  receiver  end,  use  of  the  maximum- 
likelihood  (ML)  multiuser  detector  [1]  guarantees  minimum  probability  of  error  system  performance.  ML 
optimum  multiuser  detection,  however,  comes  with  computational  complexity  that  grows  exponentially  with 
the  number  of  users. 

In  search  of  a  satisfactory  trade-off  between  performance  and  complexity,  numerous  sub-optimum  mul¬ 
tiuser  detectors  have  been  proposed.  Two  linear  multiuser  detectors  that  are  of  particular  interest  to  our  work 
are  the  familiar  zero-forcing  decorrelating  detector  [2]  and  the  minimum-mean-square-error  (MMSE)  detec- 
tor  [3]- [6] .  Both  are  known  to  be  near-far  resistant  [7].  The  decorrelator  is  the  maximum  likelihood  estimator 
of  the  energy  modulated  user  bits  for  additive  white  Gaussian  noise  (AWGN)  multiple- access  channels.  The 
MMSE  receiver  is  the  linear  filter  solution  that  maximizes  the  pre-detection  signal-to-interference-plus-noise 
ratio  (SINR)  of  each  user. 

Recently,  there  has  been  an  effort  to  directly  approximate  the  ML  multiuser  detector  in  an  efficient  and 
effective  manner.  Examples  of  such  work  are  the  “greedy”  detector  in  [8]  which  partitions  the  likelihood 
metric  function  and  proceeds  with  “part-by-part”  maximization  and  the  neighbor  searching  mechanisms  in 
[9,  10].  The  “MK-face”  detector  in  [9]  searches  within  a  pre-calculated  set  of  neighboring  candidates.  The 
gradient  search  algorithm  in  [10]  happens  to  coincide  with  the  “K-face”  detector  of  [9]. 

As  a  combinatorial  optimization  problem,  ML  multiuser  detection  has  certain  similarities  with  ML 
decoding  of  binary  linear  block  codes.  One  of  the  earliest,  well  known  low-complexity  algorithms  for  sub-ML 
block-code  decoding  is  Chase’s  algorithm  [11].  The  Chase  procedure  selects  a  small  fixed  number  of  bit 
decision  perturbations  based  on  reliability  measurements  which  are  supposed  to  be  part  of  the  demodulator 
output  (usually  directly  related  to  the  sufficient  decision  statistic).  In  this  chapter,  we  follow  an  analogous 
approach  to  tackle  the  ML  multiuser  detection  problem.  First,  we  use  a  linear  multiuser  detector  (such 
as  the  decorrelator  or  MMSE  detector)  to  obtain  an  initial  multiuser  bit  decision.  The  soft  output  vector, 
before  being  discarded,  is  processed  to  derive  the  corresponding  reliability  measurement  vector.  Next,  as 
dictated  by  the  reliability  vector,  a  sequence  of  “error  patterns”  is  generated  to  “correct”  the  initial  decision 
vector  and  produce  multiuser  bit  combination  candidates  that  include  the  maximum  likelihood  solution  with 
high  probability.  Final  decision  is  made  by  simply  choosing  the  bit  combination  with  the  highest  likelihood 
among  all  candidates.  When  the  length  of  the  error  pattern  sequence  is  in  the  order  of  the  number  of  users, 
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the  proposed  algorithm  exhibits  near-ML  performance  at  remarkably  reduced  computational  complexity 
compared  to  the  ML-optimum  multiuser  detector.1 

We  organize  the  rest  of  the  chapter  as  follows.  In  Section  7.2,  we  present  our  signal  model  and  notation. 
The  near-ML  multiuser  detection  algorithm  is  developed  in  Section  7.3.  Simulation  studies  are  presented  in 
Section  7.4  and  a  few  final  conclusions  are  drawn  in  Section  7.5. 


7.2  Signal  Model  and  Notation 


For  the  sake  of  clarity  in  presentation,  we  consider  a  synchronous  DS/CDMA  system  where  K  users  share 
a  multiple- access  AWGN  channel  with  power  spectral  density  <r2.  Our  developments  can  be  generalized  to 
cover  near-ML  block  multiuser  detection  for  asynchronous  systems  by  creating  error  pattern  sequences  of 
length  proportional  to  the  total  number  of  bits  in  a  block.  The  general  block  diagram  representation  of  the 
receiver  that  we  develop  in  this  work  is  given  in  Fig.  7.1.  The  received  signal  is  processed  by  a  front  end 
that  consists  of  a  block  of  K  signature-matched  filters.  We  denote  the  fth  bit  period  output  of  the  matched 
filter  bank  by  y (f)  =  [ yi(i ),  y2(f),  ■  •  •  ,  (T  denotes  the  transpose  operation)  and  we  write 


y (f)  =  RE(i)b(i)  +  n(f),  i  =  1,  2, 


(7.1) 


where  b(f)  =  [61(f),  62(f),  ■  ■  ■  ,  6iy(f)]T  £  {±1}A  is  the  fth  period  information  bit  vector,  E(f)  =  diag  ^  \J E±  (f), 


V (f),  •  •  •  ,  \J EK  (f ) 


is  the  K  x  K  user  signal  amplitude  matrix,  R  is  the  K  x  K  signature  cross-correlation 


matrix  and  n(f)  =  [ni(f),n2(*),  ■  ■  ■  ,nx( f)]  is  the  filtered  noise  vector.  The  ML  optimum  multiuser  detector 
[1]  is 


t>ML  =  arg  max 
befit}* 


(^(b,y) 


=  2b1  Ey  -  b 1  EREb 


(7.2) 


where  for  notational  simplicity  we  dropped  the  index  f  with  the  understanding  that  the  decision  rule  in  (7.2) 
refers  to  a  given  information  bit  interval. 


7.3  Near-ML  Multiuser  Detection 

We  switch  now  our  attention  to  the  linear  operator  L  and  the  “Search  &  Decision”  blocks  in  Fig.  7.1.  The 
linear  operator  can  be  viewed  as  a  multiuser  detector  that  operates  on  the  output  of  the  matched-filter  bank 
y  to  produce  an  initial  decision  t>(°)  for  the  transmitted  bit  vector  b.  Alongside  the  initial  decision,  we 
also  extract  a  “reliability”  measurement  vector  a.  During  the  final  “Search  &  Decision”  stage,  an  ordered 
sequence  of  D  distinct  error  patterns  e(1\  e(2),  . . . ,  e e  {0, 1}A  is  generated  from  the  reliability  vector 
a  where  the  positive  integer  D  is  a  system  parameter  to  be  adjusted  according  to  our  performance  versus 
complexity  requirements.  Corresponding  to  each  error  pattern  e(d\  d  =  1,  2,  ...,  D,  a  new  tentative 
decision  Wd'1  is  produced.  The  final  decision  output  becomes 

bNear-ML  =  arg  max  D(b,y).  (7.3) 

bGfbU),  b<2),  ....  b(-°)} 

As  we  will  see  later  in  Section  7.4,  when  L  is  the  decorrelating  or  MMSE  operator,  a  small  D  (for  instance  of 
the  same  order  as  the  number  of  users/bits  to  be  detected)  results  in  bit-error-rate  system  performance  that 
is  nearly  indistinguishable  from  the  maximum  likelihood  detector  in  operational  environments  of  practical 
interest. 

7.3.1  Initial  Decision  and  Reliability  Measurement 

We  use  a  linear  operator  (multiuser  detector)  L kxk  to  obtain  the  initial  bit  decision  vector  t>(°)  =  sgn(Ly). 
To  exploit  the  reliability  information  contained  in  the  linear  operator/filter  output,  we  define  for  each  initial 

1Another  coding  theory  inspired  multiuser  detector  was  presented  recently  in  [12].  In  contrast  to  the  ML  approxi¬ 
mations  in  [8]-[10]  and  the  work  herein,  the  multiuser  detector  of  [12]  is  a  complete  exponential-complexity  depth-first 
tree  search  implementation  of  the  full  ML  decision  rule  based  on  the  lattice  decoder  developed  in  [13]. 
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bit  decision  b[°^  the  corresponding  soft  output 


=  y,  j  =  l,  2,  K, 


(7.4) 


where  Z)  is  the  j th  row  of  the  matrix  L.  For  convenience,  we  also  define  wj  =  Z^RE  and  substitute  (7.1) 


(7.4)  to  calculate 


dj  =  wlfb +11-  n  =  Wnnbj 


K 

E 

*=1,*7 


Wjih 


n, 


j  =  1,2,...,  K. 


(7.5) 


The  first  term  on  the  right-hand  side  of  (7.5)  is  the  scaled  j  th  transmitted  bit  of  interest.  The  last  term, 
Zjn  =  rij,  is  zero-mean  filtered  Gaussian  noise  with  variance  E{n2}  =  a2ljlllj  =  a2  . .  The  middle  term 
ijtj  Wjibi  =  Zj  contains  the  information  bits  transmitted  by  the  other  users  (multiple-access-interference 
or  MAI)  and  is  a  zero-mean  random  variable  with  variance  E{z 2}  =  JOiLi  w%  =  l!wil|2  —  wjj  =  -  We 

/(aJ|bJ=S(0)) 


define  the  reliability  value  ctj  of  the  initial  bit  decision  Ir^  as  the  log-likelihood  ratio  ay  =  |  In  — - — - 

f  laj\bj=—bj  j 

where  f  (dj  \bj)  denotes  the  probability  density  function  of  dj  conditioned  on  bj.  If  the  filtered  MAI  variance 
E{z2}  is  relatively  small,  the  soft  output  dj  is  well  approximated  [14,  15]  by  a  Gaussian  random  variable 
with  mean  E  {dj}  =  Wjjbj  and  variance  E  { dj 2}  =  a2.  +  a2  . .  Then,  the  reliability  value  ay  simplifies  to  a 
scaled  version  of  dy  | : 


A  x 


A  1. 

a  .;  =  -  In 
3  2 


f 


■  =b{0) 

J  uj 


djWjjb) 


(0) 


w.j 


f  (El6; 


,  =-b{0) 

■j  °3 


(7.6) 


The  generated  reliability  vector  a  =  [ai,---  ,  ax]  is  passed  to  the  “Search  &  Decision”  stage  along 
with  the  hard- limited  initial  “guess”  b(0). 


7.3.2  Generation  of  the  Error  Pattern  Sequence 

When  L  is  the  decorrelating  or  MMSE  filter  operator,  we  expect  intuitively  that  a  small  perturbation  of  the 
initial  vector  b(0)  may  produce  the  maximum  likelihood  decision  bML  •  With  this  motivation,  we  generate  a 
sequence  of  error  patterns  £  {0, 1}K ,  d  =  1,  2,  . . . ,  where  e ^  =  1  stands  for  an  error  in  bit  position 
j.  We  apply  the  error  sequence  e(d)  on  the  initial  bit  vector  b(0)  to  create  the  bit  vector  sequence  b(d) 

as 

follows: 

bf  =  j  =  1,  2,  . . . ,  I<,  (7.7) 

where  ©  denotes  the  error  correction  operation  b  ©  e  =  b  ■  (— l)e,  b  £  {±1},  e  £  {0, 1}.  Naturally,  the  error 
sequence  generation  criterion  will  be  the  likelihood  of  b  =  b(0)  ©  e  with  observation  the  soft-output  vector 
at  =  Ly,  /  ^<S|b  =  b^j.  If,  approximately,  we  treat  the  soft  outputs  dj  as  conditionally  independent  random 

variables,  the  likelihood  function  becomes  IlyLi  /  (^j\bj  =  bj)  and  the  log-likelihood  ratio  is 


K 


A(b)--$> 


3  =  1 


/  (El6i  =  k 

f  (« j\bj  =  ~bj) 


K 


—  E  ai  E  ai  ~  E  ai  ^  E 

j:bj=bi°'>  j-.hjfth™  t=l  j-ej  =  1 


a,-. 


(7.8) 


We  immediately  observe  that  the  first  term  in  (7.8),  XyLi  aji  's  independent  of  the  choice  of  the  error 
pattern  e.  Therefore,  the  error  pattern  sequence  should  be  designed  in  ascending  order  of  the  other  term 

(f)(e,at)  =  ^2j.e  i  otj.  For  our  purposes  of  near-ML  multiuser  detection,  if  all  2K  error  patterns  {e(d)}  ,=1 
are  in  ascending  order  under  the  </>(■,  at)  key 

0(e(d),a)  <0(e<d+1),a),  d=  1,  2,  . . . ,  2^-1,  (7.9) 
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we  will  consider  and  process  only  the  first  D  elements  of  the  sequence. 

We  now  derive  an  algorithm  that  produces  the  first  D  <f>(-,  -)-ordered  error  patterns  with  computational 
complexity  linear  in  the  number  of  bits,  O(DK).  The  origins  of  our  algorithm  are  in  the  “subset-sum 
problem”  where  the  objective  is  to  find  all  subsets  of  a  set  of  numbers  that  have  sum  of  elements  less  than 
or  equal  to  a  given  number  [16,  17].  Suppose  that  the  reliability  values  ay,  j  =  1,  2,  ...,  K,  are  such 

that  ahx  <  ah2  <  •••  <  a.hK-  Define  the  error  pattern  sequences  £p  =  {e(pt)},  p  =  0,  1,  2,  ...,  K, 

t  =  1,  2,  . . . ,  2P,  where  {e(pt)}  is  ordered  under  the  key  (/>(•,  a) 

,  t  =  1,  2,  ... ,  2P  —  1,  (7.10) 

and  satisfies 

=  0  for  every  h  G  {hp+ 1,  hp+  2,  •  •  •  ,  hx}  and  1  <  t  <  2P.  (7.11) 

Here,  is  the  hth  bit  in  the  error  pattern  e(’>  and  (7.11)  states  that  may  include  errors  only  in  the 

p  lowest  reliability  positions  of  the  initial  bit  decision  vector  b(0).  Define  also  the  error  pattern  sequences 
£'p  =  },  p  =  0,  1,  2,  . . . ,  K  —  1,  t  =  1,  2,  . . . ,  2P,  where  {e'^  }  is  such  that 


(pt) 

h 


J  1,  if  h  =  hp+ 1 
\  otherwise. 


(7.12) 


Therefore,  each  e'^Pt^  contains  exactly  one  more  error  in  bit  position  hp+ 1  than  the  corresponding  error 
pattern  e^Pt\  It  follows  that  0  a)  =  0  (e(Pt\a)  +  cv/lp+1  and  {e^pd  )  js  aiso  ordered  under  the  key 

</>(■, a). 

At  this  point,  it  is  important  to  observe  the  following.  If  we  merge  the  two  ordered  sequences  £p  =  {e(')} 
and  £p  =  {e,(')}  in  increasing  order  of  the  same  key  then  we  will  obtain  £p+\.  Therefore,  iterative 

generation  of  £'p  from  £p  and  merging  to  form  £p+i  for  p  =  0,  1,  . . . ,  K  —  1  leads  to  £k  where  £k  is  the 

whole  set  of  2K  error  patterns  ordered  under  the  key  </>(■,  a).  Since  only  the  first  D  error  patterns  in  £k 

are  needed,  we  can  simplify  this  process  greatly.  We  observe  that  the  first  D  elements  in  £x  come  from  the 

first  D  elements  of  £k-  1  and  the  first  D  elements2  of  £'K_V  Therefore,  we  can  truncate  the  length  of  £k-l 
to  D  (which  also  limits  the  length  of  £'K_1  to  D).  By  induction,  we  conclude  that  every  sequence  £0,  £1, 
. . . ,  £k  can  be  shortened  to  length  D.  Considering  the  involved  computational  complexity,  we  need  up  to 
D  additions  to  calculate  </>(-,  a)  and  another  D  comparisons  to  merge  £p  and  £p  in  each  step.  Therefore,  the 
computational  complexity  is  O(DK). 

In  summary,  the  proposed  multiuser  detection  system  in  Fig.  7.1  operates  as  follows: 

(i)  Apply  the  linear  operator/filter  L  to  the  matched-filter  bank  output  y  to  obtain  the  initial  decision  b(0) 
and  the  reliability  measures  {ay}  in  (7.6). 

(ii)  Sort  {ay}  in  increasing  order. 

(iii)  Initialize  p  <—  0,  £q  —  {e(1)  =  [0  0  •  •  •  0]},  <f>  (e^,  a)  =  0. 

(iv)  Create  £'p  =  <  e'W  =  +  [0  ■■  ■  0  1  0  •••  0]  :  G  £p  >.  Calculate  0  (e'W,a)  =  (j)  (e^\a)  + 

V  ^p+1  1  1  ) 

1  • 

(v)  Merge  £p  and  £p  to  form  £p+i  in  ascending  order  under  key  </>(■,  a).  Truncate  the  length  of  £p+i  to  D 
if  longer  than  D. 

(vi)  If  p  K  —  1,  then  set  p  i —  p  - 1~1  cind  go  b^ck  to  Step  4.  Else  go  to  Step  7. 

(vii)  For  d  =  1,  2,  ...,  D  apply  e1-^  to  the  initial  decision  b(0)  as  in  (7.7).  Produce  the  final  decision 
bNear-ML  by  (7.3). 

2In  fact,  only  the  first  |_yj  elements  of  £'k~i  can  contribute  to  the  first  D  elements  of  £k  but  this  does  not  affect 
our  conclusion  on  the  rough  order  of  the  involved  computational  complexity  and  may  complicate  our  presentation 
unnecessarily. 
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The  overall  system  computational  complexity  can  be  calculated  as  follows.  The  complexity  of  apply¬ 
ing  the  linear  operator  L  to  the  matched- filter  output  y  is  0(/\2).  Sorting  the  reliability  values  costs 

O  (_fflog2  K).  As  stated  earlier,  the  complexity  of  generating  the  ordered  sequence  {e(d)}d  is  0(DK). 
Finally,  fi(-,  •)  is  calculated  D  times  with  complexity,  again,  O(DK).  Therefore,  the  complexity  of  the  whole 


scheme  is  approximately  O^K(K 


2D  +  log2  K 


)  j  per  bit  period.  The  cost  of  the  “Near- ML  Search  & 


Decide”  block  in  Fig.  7.1  is  barely  more  than  twice  the  cost  of  the  Ly  operation  if  D  is  chosen  equal  to  K 
(that  is,  if  D  is  equal  to  the  number  of  user  bits  to  be  detected).  As  we  will  see  later  in  Section  7.4,  when 
the  linear  operator  L  is  chosen  appropriately  a  value  of  D  equal  to  K  or  -more  general-  a  few  times  K  can 
be  sufficient  for  the  algorithm  to  maintain  near-ML  performance  for  all  practical  purposes. 


7.3.3  Selection  of  the  Linear  Operator 

So  far,  we  have  intentionally  treated  the  linear  operator  L  as  arbitrary.  Yet,  it  is  important  to  choose  an 
operator/filter  that  produces  initial  decisions  with  low  probability  of  error  such  that  only  a  small  perturbation 
is  needed  to  approach  the  ML  optimum  decision  vector.  Conversely,  if  the  initial  decisions  are  poor,  then  to 
achieve  the  same  bit-error-rate  performance  level  we  will  need  a  larger  search  length  parameter  D  and  that 
will  thereby  increase  the  computational  complexity. 

Besides  the  initial  performance  considerations,  the  condition  that  the  variance  of  the  MAI  term  Zj  = 
l  wjibi  in  (7-5)  is  relatively  small  should  also  be  fulfilled.  This  directly  affects  the  accuracy  of  the 
Gaussian  approximation  of  the  soft  output  otj  which  in  turn  affects  the  quality  of  the  reliability  measure  ay- 
in  (7.6).  There  are  two  well- researched  linear  filters  that  can  satisfy  the  quality  of  initial  decision  and  small 
MAI  variance  requirements:  the  decorrelating  and  MMSE  filters.  The  decorrelating  filter  LDEC  =  R-1, 
leading  to  zero  errors  in  the  absence  of  noise,  has  perfect  near-far  resistance  and  is  the  maximum  likelihood 
solution  for  real- valued  “symbol”  vectors.  If  we  use  LDEC  =  R-1  in  (7.4),  (7.5),  we  obtain  Wjj  =  E jj, 
a^lj  =  <72(R-1)jj,  and  u2  =  0.  Because  MAI  is  completely  cancelled,  the  Gaussian  approximation  is  not 
needed  anymore.  The  reliability  measures  in  (7.6)  take  the  form 

=  (R-1)..I^I’  3  =  2’  •  •  •  ’  A'’  (7-13) 


with  no  approximation.  The  MMSE  multiuser  filter  LMMse  =  (R  +  <t2E  2)  1  minimizes  the  mean  norm 
square  error  between  its  output  and  the  true  transmitted  information  bit  vector  and  maximizes  the  output 


SINR  per  user.  However,  the  filtered  MAI  term  in  (7.5)  is  not  zero:  z3 


=  yK 

1, 


Ai  w 


bi  0.  Still,  the 


MMSE  filtered  MAI  is  approximately  Gaussian  [14], [15]  and  the  reliability  measure  in  (7.6)  maintains  its 
accuracy. 

As  a  final  comment,  it  is  clear  that  technically  the  soft  outputs  dy,  j  =  1,  2,  . . . ,  K,  are  not  independent 
and  their  degree  of  dependency  can  affect  the  quality  of  the  log-likelihood  ratio  approximation  in  (7.8)  and,  in 
turn,  the  quality  of  the  generated  sequence  of  error  patterns.  When  the  linear  operator  L  is  the  decorrelator, 

EjojOj}  _ 


}E{aj 2} 

is  less  than  10-2  and 


it  is  easy  to  calculate  the  correlation  coefficient  pij  between  d?j  and  dj,  i  ^  j :  pij 
/  omd  L'3  cat-o  where  SNR^  =  In  most  cases  of  practical  interest,  p,;, 

X\,i  -\-\jh  )  (oJN  -EVj  -\-hj  j  ) 

the  independence  approximation  is  well  justified.  In  such  cases,  we  found  (see  for  example  Section  7.4) 
that  D  =  K  is  sufficient  to  approximate  very  closely  the  performance  of  the  true  ML  detector.  In  extreme 
situations,  such  as  pre-detection  SNR  values  less  than  OdB  and  high  signature  cross-correlations  (greater 
than  0.5),  pij  can  be  greater  than  0.1  and  thus  non-negligible.  In  these  cases,  a  greater  D  should  be  used  to 
approach  ML  performance.  For  MMSE  preprocessors,  we  can  calculate  pij  numerically  and,  again,  adjust 
D  accordingly. 


7.4  Simulation  Studies 

We  consider  the  DS/CDMA  signal  model  of  Section  7.2  with  spreading  gain  31  and  K  =  15  users  with  Gold 
signature  code  assignments.  In  Fig.  7.2,  we  plot  the  bit-error-rate  (BER)  of  the  user  of  interest  (user  1) 
under  different  multiuser  detection  schemes  as  a  function  of  its  SNR  over  the  4  —  16dB  range.  The  SNRs 
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of  the  14  interferers  are  distributed  in  the  8.5  —  15dB  range  with  a  0.5dB  increment.  We  use  the  multiuser 
MMSE  filter  as  the  pre-processor  in  our  proposed  system  and  we  fix  the  length  of  the  error  pattern  sequence 
to  D  =  I\  =  15  (hence,  the  computational  complexity  per  bit  interval  is  O(DK)  =  0(K2)).  For  purposes  of 
comparison,  we  include  (i)  the  ML  optimum  multiuser  detector  [1]  (which  serves  as  a  BER  lower  bound),  (ii) 
the  stand-alone  multiuser  MMSE  detector3  [3]- [6],  (Hi)  the  “greedy”  multiuser  detector  in  [8]  with  different 
memory  parameter  settings  L  —  1,2,3  and  computational  complexity  per  bit  interval  0(LK2),  (iv)  the 
“MK-face”  detector  [9]  with  parameter  setting  MK  =  2- 15  =  30,  computational  complexity  per  bit  interval 
0(MK2)  and  space  complexity  0(2K MK),  and  (v)  the  “gradient  search”  detector  [10]  that  coincides  with 
the  “K-face”  algorithm  of  [9]  and  has  computational  complexity  0(K2)  per  iteration.  We  observe  that  in  the 
relatively  low  SNR  region  (4  —  12dB)  the  greedy,  the  MK-face,  and  the  proposed  detector  have  performance 
very  close  to  the  optimum.  We  recall  that  the  greedy  algorithm  decides  the  value  of  the  user  information 
bits  sequentially  and  the  order  favors  high  energy  bits/users.  As  the  SNR  of  the  “user  of  interest”  increases 
relative  to  the  other  users,  bit  decisions  are  made  early  with  the  adverse  effect  of  an  increase  in  BER  as  seen 
in  Fig.  7.2.  For  the  MK-face  detector,  the  pre-calculated  neighboring  candidates  fail  to  perturb  high  SNR 
user  bits  and  the  performance  curve  degrades  to  the  pre-detector  (MMSE  in  this  case).  The  proposed  system 
maintains  “near-ML”  performance  throughout  the  range  of  this  study.  In  Fig.  7.3,  we  repeat  the  study  of 
Fig.  7.2  and  we  examine  the  BER  of  the  proposed  detector  as  a  function  of  the  length  of  the  error  pattern 
sequence  D  at  SNRi  =  12dB  and  14dB.  The  proposed  algorithm  approaches  effectively  the  ML  bound  for 
modest  values  of  D  (the  number  of  bits  to  be  detected  is  K  =  15;  D  =  1  corresponds  to  the  stand-alone 
MMSE  detector  and  D  =  215  corresponds  to  the  true  ML  detector). 

Finally,  in  Fig.  7.4  we  increase  the  number  of  active  users  from  K  =  15  to  K  =  25.  The  SNRs  of  the 
24  interferers  are  distributed  over  the  8.5  —  20dB  range  with  a  0.5dB  increment.  The  performance  gain  of 
the  proposed  “near-ML”  algorithm  with  D  =  4A'  =  100  over  the  MMSE  detector  is  remarkable.  True  ML 
detection  has  complexity  proportional  to  225  and  we  cannot  afford  this  computational  cost  at  present.  For 
this  reason,  we  replace  the  true  ML  bound  that  appears  in  the  previous  Figs.  7.2  and  7.3  by  the  single-user 
bound.  We  also  cannot  afford  execution  of  the  “MK-face”  algorithm  that  requires  exponential  (225MK) 
storage  space.  Instead,  we  present  in  Fig.  7.4  the  performance  curves  of  the  “greedy”  ( L  =  4)  and  “gradient 
search”  (“K-face”)  algorithms  [8]- [10] . 

7.5  Conclusions 

We  described  a  new  multiuser  detection  algorithm.  A  decorrelating  or  MMSE  multiuser  filter  is  used  as  a 
pre-processor  that  provides  initial  decisions  and  reliability  measurements  based  on  which  an  ordered  error 
pattern  sequence  of  variable  length  is  formed.  The  error  pattern  sequence  is  followed  to  its  end  and  the 
most  likely  bit  vector  among  all  visited  options  is  returned.  When  the  length  of  the  pattern  sequence  is 
of  the  order  of  the  number  of  bits  to  be  detected,  the  additional  imposed  computational  cost  compared 
to  straight  decorrelating  or  MMSE  detection  is  rather  insignificant.  Still,  in  extensive  simulation  studies 
for  both  synchronous  and  asynchronous  links  (not  reported  herein)  we  saw  that  the  proposed  multiuser 
detection  algorithm  is  able  to  maintain  near-ML  bit-error-rate  performance  over  the  whole  studied  SNR 
range  of  interest.  There  is  strong  resemblance  between  this  scheme  and  “efficient”  decoding  algorithms  for 
binary  linear  block  codes  [11,  17]. 


3  In  this  study  we  choose  a  Gold  code  assignment  that  makes  the  normalized  signature  cross-correlation  between 
user  1  and  the  other  users  equal  to  A-  or  —A.  Notice  the  remarkable  performance  difference  between  the  MMSE 
and  the  ML  detector. 


98 


Fig.  7.1.  The  proposed  “near-ML”  multiuser  detection  architecture. 


4  6  8  10  12  14  16 

SNR^dB) 


Fig.  7.2.  Bit-error-rate  versus  SNR  for  various  multiuser  detectors  ( K  =  15,  SNR;  in  8.5  to  15dB 
range,  i  =  2,  3,  •  •  •  ,  15). 
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Fig.  7.3.  Bit-error-rate  versus  length  of  the  error  pattern  sequence  D  for  the  system  of  Fig.  7.2 
(detection  of  K  =  15  user  bits). 


Fig.  7.4.  Bit-error-rate  versus  SNR  performance  in  the  presence  of  severe  MAI  ( K  =  25,  SNR,  in 
8.5  to  20dB  range,  i  =  2,  3,  •  •  •  ,  25). 
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Chapter  8 


Rapid  Combined 
Synchronization/Demodulation 
Structures  for  DS/CDMA  Systems, 
Part  I:  Algorithmic  Developments 

8.1  Introduction 

The  exponential  complexity  of  the  optimum  direct-sequence  code-division- multiple-access  (DS/CDMA)  de¬ 
modulator  [1]  in  conjunction  with  the  severe  performance  degradation  of  the  conventional  matched-filter 
(MF)  receiver  in  the  presence  of  high  power  multiple  access  interferers  (MAI)  triggered  a  substantial  research 
effort  toward  the  development  of  demodulators  with  reasonable  complexity  that  provide  reliable  decisions  on 
the  transmitted  information  bits.  As  a  result,  many  suboptimal  schemes  have  appeared  in  the  literature  such 
as  the  decorrelating  demodulator  [2], [3]  which  is  the  zero-forcing  solution  for  multiuser  interference  rejection 
in  noiseless  channels,  multistage  architectures  [4], [5],  and  decision  feedback  detectors  [6].  Supervised  back- 
propagation  [7], [8],  and  unsupervised  neural  network  structures  [9],  minimum- mean-square-error  (MMSE) 
[10]- [13] ,  minimum-variance-distortionless-response  (MVDR)  [14],  and  auxiliary-vector  receivers  [15] - [19]  are 
additional  examples. 

Similarly  to  demodulation  algorithms,  synchronization  schemes  can  be  classified  into  supervised  [20]- [27] 
or  unsupervised  (blind)  [28] - [32]  as  well  as  multi-user  [28],  [29],  [32]  or  single-user  techniques  [20]- [28] , [31] - 
[33].  In  particular,  the  successful  operation  of  supervised  synchronization  algorithms  relies  on  the  periodic 
transmission  of  a  known  training  sequence  of  information  bits  dedicated  to  the  user  of  interest.  This  is  done 
at  the  expense  of  reduced  channel  utilization  relative  to  unsupervised  techniques  where  no  such  training 
takes  place.  In  addition,  when  supervised  synchronization  algorithms  allow  the  active  users  to  use  correlated 
training  sequences  (for  example  the  “all-1”  sequence  for  all  users)  then  a  significant  amount  of  coordination  is 
required  to  ensure  that  no  more  than  one  user  is  in  “training”  mode  (otherwise  the  receiver  is  trained  for  the 
correlated  information  bit  transmissions  of  all  users  in  “training”  mode).  Alternatively,  supervised  synch¬ 
ronization  using  pseudo-random  training  sequences  requires  that  symbol-level  synchronization  has  already 
been  achieved  before  the  corresponding  training  takes  place.  On  the  other  hand,  multi-user  synchronization 
algorithms  require  knowledge  of  the  signatures  of  all  active  users  while  single-user  techniques  assume  known 
only  the  signature  of  the  desired  user  (and  possibly  the  number  of  the  active  users  in  the  system  [24], [28], [33]). 
Just  like  their  demodulator  counterparts,  multi-user  synchronization  techniques  may  be  employed  only  at 
the  base  station  where  the  required  information  is  available,  while  single-user  techniques  can  be  employed  at 
either  end  of  the  communication  link.  As  a  final  remark,  we  note  that  channel  estimation  algorithms  [33]- [37] 
implicitly  estimate  the  timing  of  the  desired  transmission  and,  thus,  may  be  used  in  place  of  a  dedicated 
synchronization  algorithm. 

We  recall  that  timing  information  pertinent  to  the  desired  user(s)  is  the  underlying  assumption  of  all 
demodulator  proposals  in  [1]- [19] .  So,  in  practice,  demodulators  must  be  preceded  by  a  synchronizer  [38]. 
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Another  approach  is  to  integrate  the  demodulator  into  a  reliable  synchronization  scheme.  In  the  first  case 
the  receiver  design  is  a  two-stage  process,  while  in  the  second  case  only  one  combined  stage  is  involved  that 
integrates  both  timing  estimation  and  demodulation  of  the  desired  user  signal.  In  this  context,  the  synch¬ 
ronization  algorithms  developed  in  [20] -[32]  are  demodulator-independent  and  as  such  they  can  precede  any 
structure  proposed  in  [1]- [19] .  However,  conventional  matched- filter- type  coarse  synchronization  techniques 
[20]  are  found  to  be  interference- limited  at  least  as  much  as  their  demodulator  counterparts  [21]  and  as  such 
they  are  unable  to  provide  reliable  timing  estimates  in  a  multiuser  environment.  On  the  other  hand,  the 
synchronization  schemes  developed  in  [24]- [41]  exhibit  the  desired  near-far  resistance  that  the  conventional 
approaches  fail  to  provide  but  they  do  so  at  the  expense  of  significant  increase  in  computational  complexity. 
A  viable  alternative  that  balances  well  computational  complexity,  system  simplicity,  and  MAI  resistance 
is  the  combined  synchronization  and  demodulation  approach  where  both  timing  estimation  and  demodula¬ 
tion  are  performed  at  the  same  time.  We  note  that  the  combined  approach  should  not  be  confused  with 
joint  timing  estimation  and  information  bit  detection  strategies  as  in  [39] -[41].  Combined  synchronization 
and  demodulation  proposals  include  the  work  in  [42], [43]  where  an  MVDR/MMSE  sample-matrix- inversion 
(SMI)-type  receiver  is  developed  of  order  (length)  equal  to  twice  the  system  processing  gain  (to  ensure 
that  the  receiver  input  vector  always  contains  a  complete  information  symbol  period).  However,  as  we  shall 
demonstrate,  the  choice  of  a  high  order  filter  combined  with  adaptive  SMI  implementations  degrades  severely 
the  performance  of  the  algorithms  under  small  data  support  scenarios. 

In  this  chapter  we  focus  on  rapid  synchronization  and  combined  demodulation  based  on  adaptive  short 
data  record  operations  that  enables  tracking  of  rapidly  changing  communication  environments.  We  propose 
two  blind  single-user  combined  synchronization/demodulation  algorithms  that  result  in  linear  structures 
of  order  L,  where  L  is  the  system  processing  gain.  First,  we  develop  an  MVDR-type  L-order  algorithm. 
Then,  motivated  by  the  limitations  of  the  MVDR  structure  under  short-data-record  operation,  we  design 
an  auxiliary-vector  (AV)-type  L-order  scheme.  Simulation  studies  demonstrate  the  effects  of  the  choice  of 
the  filter  order  and  the  choice  of  the  design  algorithm  on  coarse  synchronization  and  bit-error-rate  (BER) 
receiver  performance. 

The  chapter  is  organized  as  follows.  In  Section  8.2  we  present  the  signal  model.  Design  preliminaries 
are  covered  in  Section  8.3.  In  Section  8.4  we  develop  the  proposed  combined  synchronization/demodulation 
algorithms.  Implementation  issues  are  discussed  in  Section  8.5.  Numerical  and  simulation  comparisons  are 
included  in  Section  8.6  and  some  conclusions  are  drawn  in  Section  8.7. 


8.2  Signal  Model 

We  consider  an  asynchronous  DS/CDMA  system  populated  by  K  active  users  transmitting  over  a  common 
additive  white  Gaussian  noise  (AWGN)  channel.  The  received  signal  r(t)  is  the  superposition  of  the  K 
transmissions  corrupted  by  channel  noise,  i.e. 

K- 1 

r(t)  =  Y  rk(t  -  rk )  +  v{t).  (8.1) 

k= 0 

In  the  above  expression,  Tk  is  the  delay  of  the  kih  user  relative  to  the  receiver’s  time  reference  point  and 
v(t)  is  AWGN.  The  fcth  user’s  contribution  to  the  received  signal  r(t)  is  given  by 

OO 

rk{t)=  Y  y/Ekbk{i)Sk(t-iT),  k  =  0,...,K-l,  (8.2) 

i=  —  oo 

where  bk(i)  £  {  —  1,  +1}  is  the  Ah  information  bit,  Ek  is  transmitted  energy,  and  T  is  the  information  bit 
period.  The  signature  waveform  Sk(t)  assigned  to  the  kth.  user  has  the  form 

L—X 

Sk(t)  =  Ysk(l)pTc{t-lTc),  k  =  0, . . . ,  K  —  1,  (8.3) 

1=0 

where  Tc  is  the  chip  duration,  Ptc  is  a  rectangular  pulse  with  support  [0,  Tc],  L  =  T/Tc  is  the  system 
processing  gain,  and  sk{l)  is  the  /th  element  of  the  normalized  bipolar  signature  or  spreading  vector  sk  = 
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[sfc(O),  Sfc(l), . . . ,  sk(L  —  1)]T  that  uniquely  identifies  the  A:th  user.  We  assume  that  the  signature  waveforms 
are  normalized  to  unit  energy  over  a  bit  period,  i.e.  /QT  S^.{t)dt  =1,  A:  =  0, . . . ,  I\  —  1. 

Without  loss  of  generality,  we  let  rk  £  [0,  X1),  k  =  0,  ...,K  —  1,  in  (8.1).  Thus,  we  may  write  the 
delay  rk  as  a  sum  of  an  integer  multiple  of  chips  plus  a  fraction  of  a  chip,  i.e.  rk  =  (nk  +  5k)Tc  where 
nk  £  {0, . . . ,  L  —  1}  and  5k  £  [0, 1).  After  chip- matched  filtering  and  chip-rate  sampling,  the  continuous-time 
signal  r(t)  in  (8.1)  is  reduced  to  a  discrete-time  sequence  {r(n)}5)(L_00  where 

f(n+l)Ta 

r(n )  =  /  r(t)dt.  (8-4) 

J  nTc 

We  note  that  since  the  chip  timing  of  the  desired  user  is  unknown  to  the  receiver  the  integrate-and-dump 
operation  in  (8.4)  will  result  in  SNR  loss  due  to  the  averaging  of  adjacent  chips.  This  SNR  loss  does  not 
exceed  3dB  [24]  and  can  be  reduced  by  oversampling  the  output  of  the  chip-matched  filter  at  the  expense  of 
increased  receiver  complexity.  By  substituting  (8.1),  (8.2),  and  (8.3)  into  (8.4),  we  obtain: 


r(n) 


K-l 


y  i  yEkbi 


k= 0 


V  Ekbk 


n-rik 
L 

n  —  rik  —  1 


(1  -  5k)sk  {{n  -  rik)  T  L) 

5kSk  ((n  -  nk  ~  1)  T  L)  X  +  v(n) 


(8.5) 


where  v(n)  is  chip-matched  filtered  and  sampled  AWGN  with  variance  u2,  [a:J  denotes  the  largest  integer 
smaller  than  or  equal  to  x  and  denotes  the  modulo-L  operation. 

The  problem  we  consider  in  this  work  is  the  development  of  a  linear  receiver  for  combined  synchronization 
and  demodulation  of  the  information  bit  of  the  user  of  interest,  user  0.  The  only  known  quantity  is  the 
signature  of  the  user  of  interest  s0.  All  other  system  parameters,  such  as  energies  and  delays  of  all  users  as 
well  as  signature  vectors  of  the  interfering  users,  are  considered  unknown. 


8.3  Design  Preliminaries  and  Background 

Let 

s(nk)  a  ,Sk(0),...,sk(L-nk-l)]T  + 

rik 

5k  \0, .  .  . ,  0,  sk  (0),  ■  ■  ■ ,  sk (L  -  nk  ~  2)]T,  (8.6) 

nk+ 1 

and 

g^-)  a  (1  _  5k)[Sk(L  -  nk),..  .,sk(L  -  1),0^^0]T  + 

L—nk 

8k[sk(L-nk  -  1 sk(L-  1), 0, . . .  ,0jr,  (8.7) 

L—nk— 1 

for  k  =  0, . . . ,  K  —  1.  Then,  under  the  assumption  of  a  linear  filter  receiver  of  length  L,  the  corresponding 
Lx  1  received  input  vector  r  ^  =  [ r(iL ), . . .  ,r((L  +  1  )L  —  1)]T  £  $tL  can  be  written  as  follows: 

r i  =  y/^o  {&o(*)so”o)  +  60(*  -  l)sono_)|  + 

X!  \/%{bk(i)s[nk) +bk(i-l) S^fc“)}+iz  (8.8) 

k= 1 

where  v  is  an  AWGN  vector  with  autocorrelation  matrix  <r2I.  The  decision  on  the  bit  bo(i)  that  modulates 
s'no)  in  the  vector  r*  is  set  to  be 

b0(i)  =  sgn(wTri)  (8.9) 
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where  w  e  is  the  filter  tap-weight  vector. 

In  this  chapter,  we  develop  combined  synchronization/demodulation  algorithms  that  are  based  on  either 
the  MVDR  or  the  AV  filtering  approach.  We  recall  that  an  MVDR-type  filter  [44]  is  a  tap-weight  vector  w 
that  minimizes  the  output  variance/energy  E{|wTrj|2}  (£!{•}  denotes  the  statistical  expectation  operation) 

and  at  the  same  time  is  “distortionless”  in  a  reference  direction  d,  i.e.  wTd  =1.  If  R  =  E{vivf }  denotes 
the  autocorrelation  matrix  of  the  received  vector  ri:  then  the  MVDR  tap- weight  vector  is  given  by 


R  'd 

™d,MvnR  ~  dTR-ld' 


(8.10) 


In  the  rest  of  this  chapter,  the  first  subscript  of  the  filter  variable  w  will  identify  the  reference  vector  while 
the  second  subscript  will  identify  the  receiver  type  (MVDR  or  AV).  When  the  latter  is  not  present,  either 
type  is  applicable. 

To  obtain  the  solution  in  (8.10)  for  a  given  reference  direction  d,  it  suffices  to  decompose  w  in  the  form 
w  =  ]pjp  y  where  dTy  =  0  and  search  for  the  vector  y  that  minimizes  the  variance  at  the  output  of 
w.  Instead  of  optimizing  the  orthogonal  direction  y  as  a  whole,  we  further  decompose  the  problem  and 
optimize  with  respect  to  a  sequence  of  weighted  auxiliary  vectors  gm  and  their  corresponding  scalar  weights 
/.tm,  m  =  1,  2, ... ,  separately.  The  general  method  is  termed  auxiliary-vector  (AV)  filtering.  The  concept  of 
AV  filtering  as  introduced  in  [15]- [17]  pertains  to  a  particular  choice  for  gTO  and  the  use  of  a  conditionally 
MS-optimum,  scalar  pm  for  a  given  gm.  For  a  receiver  design  with  M  auxiliary  vectors,  the  linear  tap  weight 
vector  wd  l4v(M)  can  be  expressed  as  follows: 


Wd,AV(M) 


M 

^  ^  MmSm?  M  =  1?  2,  3,  .  .  .  . 

m=  1 


(8.11) 


In  contrast  to  [17]  that  proposes  auxiliary  vectors  gm,  m  =  1, . . . ,  M ,  orthogonal  to  one  another  (and  thus 
limits  the  maximum  number  of  auxiliary  vectors  M  to  L  —  1),  in  this  chapter  we  relax  the  latter  constraint 
and  we  follow  the  proposal  of  [18]  and  [19].  That  is,  we  select  the  auxiliary  vectors  gm  and  the  steering  scalars 
pm,  m  =  1, . ..  ,M,  recursively  through  conditional  optimization  according  to  the  following  two  criteria:  (i) 
The  ?7ith  auxiliary  vector  gm  is  set  to  maximize  the  magnitude  of  the  cross-correlation  between  its  output 
and  the  output  of  the  previous  step  filter,  wd  Av(m_1)  =  d  —  subject  to  the  constraint  that  gm  is 

orthonormal  to  the  reference  vector  d.  (ii)  The  steering  scalar  pm  is  set  to  minimize  the  MSE  between  the 
outputs  w^]j4v(m_1)rj  and  g^r*.  The  whole  algorithm  is  summarized  below  in  the  following  simple  recursive 
form  [19]: 

d 

Wd,AV(m)  =  Wd,,4V(rr*-l)  —  Itllgm!  771  =  1,  2,  ...  ,  Wd-j4V(0)  =  ||^||2  , 

_  R'WdAl'(m-l)  —  (d  hWd,AV(m-l))d 

Sm  -  (dTRwd^v(m_1))d||’ 

_  glRffdAT(m-l) 

IP  m  ff7Rg 

By  inspection,  we  observe  that  for  the  MS  optimum  value  of  pm  the  product  pmgm  is  independent  of  the 
norm  of  g,„.  Therefore,  we  may  drop  the  unnecessary  normalization  operation  in  (8.13).  Then,  the  auxiliary 
vector  generation  procedure  may  stop  when  gm+i  =  0  and  in  that  case  wd  j4v(m)  is  exactly  equal  to  w d,MVDR 
(formal  theoretical  analysis  and  convergence  proof  can  be  found  in  [19]). 

The  auxiliary-vector  filter  wd  j4v(m)  has  two  advantages  in  comparison  with  the  wdMvr,H  filter  in  (8.10). 
First,  no  matrix  inversion  operation  (neither  explicit  nor  implicit)  is  required  for  the  auxiliary- vector  filter. 
The  second  and  most  important  advantage  has  to  do  with  the  short  data  record  behavior  of  the  filter 
estimators  wd iAi,(0),wd iW(1),  wdj4v(2),  ...  and  wdMVDR  that  are  based  on  an  7V-point  estimate  of  the 
autocorrelation  matrix  R  =  r^rT1.  As  illustrated  in  [19],  for  a  fixed  finite  data-record-size  N  the 

sequence  {wd  j4v(m)}m  provides  an  infinite  number  of  filter  estimates  with  varying  bias  versus  covariance 

1The  sequence  {w d:Av(m.)}m  can  be  viewed  not  only  as  a  sequence  of  estimates  of  the  ideal  AV-filter  sequence  in 
(8.12)-(8.14)  but  also,  and  most  importantly,  as  a  sequence  of  estimates  of  the  MVDR  filter  in  (8.10). 


(8.12) 

(8.13) 

(8.14) 
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characteristics.  For  short  data  records  N ,  the  early,  non-asymptotic,  elements  of  the  generated  sequence  of 
AV  estimators  offer  favorable  bias/co  variance  balance  and  are  seen  to  outperform  significantly  in  mean-square 
estimation  error  the  w d,Mvnn  estimator.  This  translates  to  rapid  acquisition  and  superior  short  data  record 
BER  performance  as  demonstrated  in  the  following  section.  Data-record-based  criteria  for  the  selection  of 
the  best  AV- filter  in  the  sequence  have  been  recently  reported  in  [45],  [46]. 

The  use  of  the  reference  vector  d  as  described  above  in  the  context  of  MVDR  or  AV  filtering  allows  the 
design  of  DS/CDMA  filters  that  are  distortionless  in  the  vector  direction  of  the  user  signal  of  interest.  For 
example,  in  perfectly  synchronous  systems  the  vector  direction  of  the  signal  of  interest  is  So-  In  coarsely 
synchronized  systems  (no  =  0)  the  vector  direction  Sg  of  the  signal  of  interest  is  a  convex  combination  of  So 

and  s‘+1)  =  [0,  so(0), . . . ,  so(L  —  2)]T  given  by  the  following  expression: 

sS  =  (1  —  <50)[s0(0), ..  .,s0(L  —  1)]T  +  <50[0,  s0(0), . . . , s0(L  —  2)]T  (8.15) 

=  (1  —  ^o)so  +  ^oso+  (8.16) 

By  setting  the  reference  vector  direction  d  equal  to  Sq  we  guarantee  that  the  energy  of  the  desired  transmission 
is  protected  and  the  reduction  of  the  filter  output  energy  will  be  due  to  interference  and  noise  suppression 
only.  However,  in  non-synchronized  systems  the  actual  vector  direction  of  the  desired  signal  Sq"0-*  is  given 
by  (8.6)  for  some  unknown  no  ^  0  and  it  is  different  from  Sg.  In  this  case,  a  receiver  distortionless  in 
the  Sg  direction  will  treat  the  desired  transmission  as  interference  (since  it  is  no  longer  protected  by  the 

constraint  w/Isq  =  1)  and  it  will  suppress  a  portion  of  it.  The  lower  the  cross-correlation  between  Sg 

and  Sgn°^  is,  the  more  severe  the  suppression  of  the  desired  signal  can  be.  Therefore,  a  minimum  output 
variance  receiver  distortionless  in  a  direction  different  than  the  actual  vector  direction  of  interest  exhibits 
lower  output  energy  than  the  output  energy  of  the  minimum  output  variance  receiver  that  is  distortionless 
in  the  actual  direction  of  the  user  of  interest.  This  observation  forms  the  basis  of  the  proposed  combined 
synchronization/demodulation  algorithm  which  is  discussed  in  more  detail  in  the  following  section. 

8.4  Order- L  Combined  Synchronization/Demodulation 

Effective  demodulation  of  DS/CDMA  signals  requires  a  received  vector  that  contains  at  least  one  complete 
information  symbol.  Chip-rate  sampling  of  the  chip-MF  output  at  the  receiver  front-end  implies  that  when 
we  use  a  2L-long  [42]  or  longer  received  vector,  the  latter  condition  is  always  satisfied  regardless  of  the 
delay  of  the  desired  transmission.  This  requires  the  use  of  a  linear  filter  of  order  at  least  2 L.  However,  the 
longer  the  employed  filter  is,  the  higher  the  optimization  and  computational  complexity.  In  addition,  as 
we  shall  show  in  Chapter  9,  adaptive  SMI  implementations  of  receivers  employing  long  linear  filters  result 
in  severe  performance  degradation  in  small  data  support  situations.  For  these  reasons,  practical  receiver 
designs  should  be  based  on  small  filter  order  choices.  On  the  other  hand,  the  shortest  input  vector  that  is 
possible  to  contain  a  complete  information  symbol  is  of  length  L,  where  L  is  the  system  processing  gain.  This 
implies  that  the  length  of  an  effective  combined  synchronization/demodulation  structure  is  lower  limited  by 
L.  Thus,  our  first  objective  toward  the  development  of  an  effective  receiver  is  to  employ  filters  of  order 
L.  Our  second  objective  is  to  integrate  synchronization  and  demodulation  into  a  “combined”  scheme  so 
that  re-alignment  of  the  received  data  and  re-evaluation  of  a  demodulation  filter  is  no  longer  needed.  To 
have  a  receiver  readily  available  by  the  time  synchronization  is  achieved,  the  proposed  L-order  combined 
synchronization/demodulation  algorithms  consist  of  the  following  two  general  steps.  In  the  first  step,  the 
[0,  T)  timing  uncertainty  interval  is  quantized  to  a  finite  number  of  hypotheses.  For  each  hypothesis,  the 
received  samples  are  grouped  into  vectors  of  length  L  and  a  pair  of  linear  receivers  (distortionless  in  two 
different  fractional  shifts  of  the  direction  of  interest)  is  evaluated  based  on  the  statistics  of  the  corresponding 
input  vector  sequence.  For  each  distortionless  direction,  the  receiver  with  the  highest  output  energy  over  all 
hypotheses  provides  a  coarsely  synchronized  receiver.  In  the  second  step,  we  obtain  a  refined  structure  by 
maximizing  the  output  energy  of  a  linear  combination  of  the  two  receivers  that  have  already  been  evaluated 
in  the  first  step.  In  the  rest  of  the  chapter,  we  refer  to  the  first  step  of  the  proposed  algorithm  as  the 
coarse  synchronization/ demodulation  step  and  to  the  second  step  as  refined  synchronization/demodulation. 
Finally,  our  third  objective  is  to  achieve  rapid  synchronization  and  superior  BER  performance  under  short 
data  record  adaptation.  In  this  section  we  develop  two  structures:  An  L-order  MVDR-type  and  an  L-order 
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(b) 


Fig.  8.1.  Coarse  synchronization  based  on  filters  of  order  L:  (a)  Block  diagram,  (b)  Algorithmic 
description  of  the  MYDR  and  AV  filters. 


Fig.  8.2.  Construction  of  the  sequences  r^,  l  =  0, . . . ,  L  —  1. 


AV-type  linear  structure.  Both  schemes  meet  the  first  two  objectives  and  exhibit  superior  BER  performance 
when  compared  to  their  2 L  counterparts;  the  latter  scheme  is  the  most  favorable  under  short  data  record 
adaptation. 

In  the  rest  of  the  section  we  present  the  details  of  the  proposed  schemes.  Fig.  8.1  shows  the  block  diagram 
description  of  an  MVDR-type  or  AV-type  coarse  synchronizer/demodulator  that  operates  on  L-long  received 
vectors.  The  buffer  follows  a  chip-rate  sampler  and  groups  the  received  samples  into  L  sequences  of  vectors, 
one  for  each  of  the  L  possible  values  of  no,  as  depicted  in  Fig.  8.2. 

The  ith  vector  of  the  1-vector  sequence  is  given  by 

rf  =  [r(iL  +  l),r(iL  +  l  +  l),...,r({i  +  l)L  +  l-l)]T,  l  =  0,...,L-l.  (8.17) 

The  subscript  i  of  r[l\  i  =  0, . . . ,  N  —  1,  denotes  the  order  of  the  received  sample  vector  in  any  of  the  above 
sequences  in  a  data  record  of  size  N.  For  l  =  no,  the  received  vectors  r^"°\  i  =  0, 1, . . . ,  N  —  1,  contain  a 
complete  information  symbol  that  is  modulated  by  the  vector  Sq  given  by  (8.16).  To  efficiently  demodulate 
the  desired  user  transmission,  a  linear  receiver  should  operate  exactly  on  the  {r,*"0-*}  received  vector  sequence. 
Thus,  efficient  and  effective  estimation  of  no  is  needed  and  this  is  performed  implicitly  by  the  first  step  of  the 
proposed  algorithm.  The  discussion  in  the  previous  section  suggested  that  the  output  energy  of  a  receiver 
that  is  distortionless  in  the  sg  direction  is  maximized  when  the  offset  l  in  Fig.  8.2  is  equal  to  no-  In  other 
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words, 


n0  =  argmaxlwfP  R(0ws(i)  |  l  =  0,...,L-l}  (8.18) 

where 

R(0  =  E{y(-\(P  }  (8.19) 

is  the  autocorrelation  matrix  of  the  received  vectors  of  the  Zth  sequence,  l  =  0, . . . ,  L  —  1,  and  is  an  AV 
or  MVDR-type  filter  distortionless  in  Sq  and  evaluated  based  on  the  statistics  of  the  filter  input  sequence 

(rf)}  (for  example  vrpMVDR  =  R(^  1sJ/  ^s^R*^  ^J^).  Indeed,  the  following  proposition  shows  that 

(8.18)  holds  true  for  the  case  of  a  power-controlled  system  employing  an  MVDR  receiver,  provided  that  the 
SNR  of  the  received  signals  is  sufficiently  large.  The  proof  is  included  in  the  Appendix. 


Proposition  8.1  In  power- controlled  systems  and  for  sufficiently  large  user  SNRs,  the  quantity 


Air 


w',  r(Ow(P  /  —  0  L  —  1 


(8.20) 


is  maximized  for  l  =  Uq,  where  wp 


R(I> 


si., MVDR 


R<!>- 


□ 


Therefore,  to  achieve  coarse  synchronization  with  the  desired  transmission,  it  suffices  to  find  the  value  of 
l  that  maximizes  the  output  energy  of  a  receiver  distortionless  in  the  direction  of  the  vector  Sq.  To  overcome 
the  fact  that  at  this  stage  the  parameter  do  of  the  effective  signature  Sq  is  unknown,  we  use  two  normalized 
reference  vectors  do  and  di  given  by  (8.16)  for  do  =  0  and  do  =  0.5,  respectively,  i.e. 

do  =  so,  (8.21) 

dx  =  (0.5s0  +  0.5sq+1))/||0.5s0  +  0.5sq+1)  ||.  (8.22) 

The  coarse  synchronization  algorithm  is  summarized  below. 


L-order  coarse  synchronization  algorithm 

(i)  For  l  M  0, . . ,  L  —  1,  calculate  the  Zth  autocorrelation  matrix 

r(')  =  E{  rf  )rf )T}  (8.23) 

where  the  vector  rP  is  given  by  (8.17). 

(ii)  For  each  autocorrelation  matrix  calculate  the  AV  (or  MVDR)  filters  wpt  and  wp  that  are  distortionless 
in  the  d0  and  dx  vector  direction,  respectively. 

(iii)  Calculate  the  output  energy  of  the  filters: 

OEW  =  wPJr^wP  and  0£(2i+1>  =  w^R^w^,  ?  =  0,...,L-1.  (8.24) 


(iv)  Let 

jmax  =  argmaxIOF^Ij  =  0, . . . ,  2L  -  1}. 
j 


(8.25) 


Then, 


RO  —  \_jmax /2J  • 


(8.26) 


The  effective  signature  in  is  the  vector  Sq. 

Receiver  refinement  can  be  performed  as  follows.  In  the  case  of  the  MVDR  receiver,  the  ideal  (perfectly 
refined)  receiver  is  given  by 


R(ft°r 


,(<M 


n0||  <50). MVDR 


|(do)R("o)-1s*|!(d0) 


(8.27) 
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where  Sm0,,(<5o)  is  the  normalized  effective  signature 


,  A  (1  -  5)  Sp  +  fep+1) 
||(l-^)s0  +  ^+1)| 


(8.28) 


evaluated  at  S  =  Sq.  Since  (1— ^o)so+^oso+1'*  =  (1  —  2^o)do+2^o||0.5so+0.5sg+1)  ||di,  the  filter  in  (8.27)  can  be 
expressed  as  a  linear  combination  of  the  two  already  evaluated  MVDR  filters  wj,1 1°mvdr  and  wd"°Mvr>ii  (or 
as  a  linear  combination  of  w^”"^ D R  and  w when  jmax  is  even  and  OE(jmax- 1)  >  OE(jmax  + 1)). 

Based  on  the  above  observation  we  may  refine  our  initial  filter  estimates  Wj )^mvdr  and  wd i^mvdr  by 
linearly  combining  them  to  produce  a  filter  w mvdr  with  maximum  output  energy.  The  combination  rule  is 
given  by  the  following  proposition.  The  proof  is  included  in  the  Appendix. 


Proposition  8.2  (i)  Let  W  be  the  set  of  linear  combinations  ofw^l°\iVDR  and  'w'a°mvdr  °f  form 


,(«o) 


cco(^)'' 


'do,. MVDR 


ai(<5)l 


'di  ,MVDR 


(ao(<^ 


,(«o) 

'd0,  MVDR 


au(^)’"'<rt°'* 


di  ,MVDR 


with 


ai(<S)  = 


Then,  the  refined  filter  w, 


a0{5)  =  (l-2S)s^R^~1So, 

2^(0. 5s0  +  0.5sQ+1))TR(”°)_i(0.5s0 


0.5s 


(+i) 


).5Sn  +  0.5s, 


(+l)l 


,(<5) 


,  and  S  £  [0, 1). 


j  is  the  solution  to  the  following  optimization  problem: 

w mvdr  =  arg  max  wTR("o)w. 
wGW 

(ii)  Equivalently,  the  filter  w MVDR  is  given  by 

~  _  ao(^max)wd0,\lVDR  +  al(dmax)'wd1JMVDR 


(fxo(dmax)^d0,MVDR  +  al(^mai)wdi,WDfi)  S|j0||(^max) 
where  Smax  is  the  value  of  S  £  [0,1)  that  maximizes  the  output  energy  of  an  MVDR  receiver,  i.e. 


Smax  =  arg  max 

<5e[o,i)  L 


(8.29) 

(8.30) 

(8.31) 

(8.32) 

(8.33) 


(8.34) 

□ 


The  case  of  the  AV  receiver  is  treated  similarly.  To  obtain  a  combined  synchronization/demodulation 
scheme,  we  form  (as  in  the  MVDR  case)  the  final  refined  receiver  ww,  as  a  linear  combination  of  two 
coarsely  synchronized  AV  receivers,  namely  Wj*0^  and  that  were  already  obtained  during  the 

coarse  synchronization  process  (or  as  a  linear  combination  of  w and  when  jmax  is  even  and 

OE(j 

max  1)  >  OE(j 

max  +  l)),i.e. 

wAV  =  a0(S)w  +  an(S)w^v.  (8.35) 

To  ensure  that  wAV  is  of  the  general  form 

™av  =  s|0||(^)  +  y  (8-36) 

for  some  y  orthogonal  to  sj*j0||(<5),  the  scalar  weights  ao(S)  and  ai((5)  in  (8.35)  are  first  set  equal  to 

a0(S)  =  (1  —  2(5)  / 1 1  ( 1  —  5)  s0  +  (5sq+1)||  and  (8.37) 
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(8.38) 


£*i(<S)  —  2<5 1 1 0.5so  +  0.5sq+  ''||/||(1  —  <5)so  +  <5sq+  ^ 

If  we  combine  (8.35),  (8.37),  and  (8.38)  we  obtain 

w  w  =  s||o||(5)  +  a0(^)yo  +  ai(5)yi  (8.39) 

where  yo  =  —  Sti  Mdo,igd0,*  and  yi  =  —  HiLi  Mdi.igdi.i  (the  subscripts  do  and  di  of  /.q  and  g;  indicate 
the  corresponding  AV  filters  obtained  during  coarse  synchronization,  and  wd?iy  respectively). 

We  observe  that  (8.39)  is  not  necessarily  of  the  form  of  (8.36)  since  the  vector  ao(^)yo  +  ou(5)yi  is  not 
necessarily  orthogonal  to  sj*j0||(<5).  All  we  need  is  to  use  the  orthogonal  projections  yfJ  and  y^  onto  the 
subspace  spanned  by  SJ|0||  (^)  f°r  ^  S  [0, 1)  instead  of  yo  and  yi,  respectively.  Then,  y  =  ao(^)yg“  +  &i{S)y:t 
is  orthogonal  to  s^y  (5)  for  any  S  G  [0, 1)  and  does  qualify  as  the  vector  y  in  (8.36).  We  note  that  the  vector 
set  {s|0M(5)  |  i  £  [0, 1)}  has  dimension  2,  therefore  y  is  given  as  follows: 

y  =  a0(<S)(I  -  u0uj  -  uiuf)y0  +  ai(5)(I  -  u0uj  -  uiuf)yi,  (8.40) 

where  {uo,Ui}  is  an  orthonormal  basis  of  the  space  spanned  by  {so,Sq+1^}.  Combining  (8.39)  and  (8.40)  we 
obtain  an  expression  for  the  AV-type  receiver  as  a  function  of  5: 

ws.o||(5),^v  =  S|j0||(<5)  +  a0(<S)(I  -  u0uj  -  uiuf)y0  +  ai(<5)(I  -  u0uj  -  uiuf)yi.  (8.41) 

Finally,  the  combined  synchronization/demodulation  AV-type  receiver  is  given  by  w„.  (smaa,),Av  where 

Smax  =  arg  max  E(S)  =  arg  max  {wJ,o||(5)  AyR(”o)ws,o||(S)ij4v}.  (8.42) 

In  practice,  the  autocorrelation  matrix  R(d,  l  =  0, . . . ,  L  —  1,  is  sample-average  estimated: 

=  (8-43) 

i=  0 

where  N  is  the  data  record  size.  For  values  of  N  less  than  the  filter  order  L  the  estimate  in  (8.43)  is  not 
full-rank,  thus  the  sample  matrix  inversion  (SMI)  estimate  of  the  MVDR  filter  does  not  exist. 


8.5  Implementation  Issues 


Evaluation  of  the  sample  average  estimate  of  the  input  autocorrelation  matrices  R(d,  l  =  0, . . . ,  L  —  1,  consti¬ 
tutes  a  major  part  of  the  computational  burden  of  both  proposed  algorithms.  Straightforward  application  of 
(8.43)  involves  a  total  number  of  floating  point  operations  of  order  0(L3).  However,  the  special  structure  of 
the  matrix  R(d  allows  evaluation  with  a  total  number  of  floating  point  operations  of  order  0(L2).  This  can 
be  achieved  by  exploiting  the  “overlapping”  property  of  the  autocorrelation  matrix.  Indeed,  we  may  observe 

that  the  matrices  R(d  are  submatrices  of  R  =  -^  Xlili  ,  where  the  2L-long  vectors  r*,  *  =  0, IV  —  1, 
are  formed  by  a  sliding  window  such  that  successive  vectors  share  L  common  data  samples: 


ri  =  [r[iL\,r[iL+  l],...,r[(i  +  2)L-  1]]T  , 

=  H(*  +  l)L],r[(i  +  1)L+  1],..  .,r[(i  +  3)L  -  1]]T  . 


More  precisely, 


(  R(l,l ) 


R(0  = 


R(l>,  l  +  L  —  1)  \ 


,  l  —  0, . . . ,  L  —  1, 


\  R(l  +  L  —  1,  l)  ...  R(l  -\-  L  —  1,  l  -\-  L  —  1)  / 


(8.44) 

(8.45) 


(8.46) 
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Table  8.1.  Computational  Complexity  of  Covariance  Matrix  Estimation 


L- 

order 

Additions: 

~JL{L 

+ 

1)(N- 

-1) 

Multiplications: 

1  L(L 

+ 

1  )N 

2  L 

-order 

Additions: 

L(2L 

T 

1  )(N~ 

-1) 

Multiplications: 

L(2L 

+ 

1  )N 

where  R(i,  j),  i  =  0, . . . ,  2L—  1,  j  =  0, 2L  —  1,  denotes  the  (i,j)th  element  of  R.  In  Table  8.1  we  show  the 
total  number  of  multiplications  and  additions  needed  for  the  evaluation  of  RT\  l  =  0,.» , . ,  L  —  1,  (proposed 
L-order  algorithms)  and  R  (2L-order  algorithms  [42])  based  on  a  data  record  of  size  N. 

On  the  other  hand,  the  evaluation  of  the  inverse  estimates  [rTJ  required  by  the  SMI-MVDR-type 

algorithm  involves  computations  of  order  0(NL3).  Inverse  matrix  estimation  can  be  achieved  recursively 
described  in  the  following  proposition.  The  proof  is  included  in  the  Appendix. 


Proposition  8.3  Let  the  matrix  R*,+1)  be  partitioned  as  follows: 

R('+1)  =  (  J?  b'+1 

V  °;+i  ai+i 

where  C  is  an  (L  —  1)  x  (L  —  1)  matrix,  b;+i  is  an  (L  —  1)  x  1  vector,  and  ai+\  is  a  scalar.  Then 


R(,+1)  1  = 


C 


-l 


C-1b,+1bj‘+1C 


ai  +  l—  bzI^1C  1b;  +  i 

(br+1C-1bl+1-aI+1)-1bf+1C-* 


(b/, , C  'b/fl  r/,,,)  'C  'b/tl 


(ai+1  -  br+1c-%+1) 


The  matrix  C  1  is  given  by 


c-1  =  T  -  — /3/3t 
a 


where  the  scalar  a,  the  vector  and  the  matrix  r(L-i)x(n-i)  form  the  partition  o/R(^ 

R(0_1  _  (  01  @ 

1  p  r 


(8.47) 

(8.48) 

(8.49) 

(8.50) 

□ 


In  Table  8.2  we  show  the  total  number  of  additions,  multiplications,  and  divisions  needed  by  the  proposed 
L-order  coarse  synchronization  algorithms  (SMI-MVDR  and  AV-type)  and  the  2L-order  SMI-MVDR-type 
coarse  synchronization  algorithm  [42].  The  AV  scheme  utilizes  M  auxiliary  vectors.  It  is  interesting  to  note 
that  the  sum  of  the  total  number  of  floating  point  multiplications  and  divisions  required  by  the  L-order 
SMI-MVDR  algorithm  is  at  least  45%  less  than  the  corresponding  number  required  by  the  2L-order  SMI- 
MVDR  algorithm.  At  the  same  time  the  L-order  algorithms  outperform  the  2L-order  algorithms  in  coarse 
synchronization  error  and  bit-error-rate,  as  we  will  see  in  the  next  Section. 


8.6  Simulation  Results  and  Comparisons 

We  consider  a  11-user  asynchronous  DS-CDMA  system  that  utilizes  Gold  sequences  of  length  L  =  31.  We 
compare  the  proposed  L-order  coarse  synchronization  algorithms  with  the  2L-order  scheme  in  [42].  We  also 
develop  the  2L-order  AV-type  algorithm  that  we  include  together  with  the  L  and  2L-order  MF-type  receivers 
as  reference  points.  The  probability  of  coarse  synchronization  error  or  “coarse-synchronization-error-rate” 
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Table  8.2.  Computational  Complexity  of  Coarse  Synchronization  Algorithms 


L-order 

SMI-MVDR 

Additions:  (2 L  -  1  )(L  -  l)2  +  J{L  -  1  )L(L  -  1/2)  +  J(L  -  1  )L  +  2 L(L2  -  1) 

Multiplications:  2  (L  -  1  )2(L  +  1)  +  %L(L  -  1  )(L  -  1/2)  +  |L(L  -  1)  +  2  L2(L  +  1) 

Divisions:  5  L  —  2 

AY 

Additions:  2L(L2  —  1)  +  2ML[6L  —  3  +  L(L  —  1)] 

Multiplications:  (12M  +  2)L 2  +  2(M  +  1)L3 
Divisions:  2  ML 

2  L-order 

SMI-MVDR 

Additions:  |L(2L  —  1)(2L  —  1/2)  +  |L(2L  —  1)  +  2L3 

Multiplications:  ^L(2L  —  1)(2L  —  1/2)  +  ®L(2L  —  1)  +  2  L3  +  2L(L  +  1) 

Divisions:  4L 


(CSER)  is  defined  as  the  probability  that  the  delay  estimate  provided  by  the  coarse  synchronization  algorithm 
is  off  by  more  than  0.5TC  from  the  actual  delay.  The  coarse-synchronization-error-rate  is  evaluated  by 
averaging  over  5,000  independent  simulation  experiments.  The  delays  of  all  users  are  chosen  randomly  and  are 
kept  constant  over  5  independent  experiments.  The  autocorrelation  matrix  of  the  received  signal  is  estimated 
by  sample  averaging  for  both  the  SMI-MVDR  and  the  AV  scheme.  Finally,  the  number  of  auxiliary  vectors 
utilized  by  the  AV  receiver  is  optimized  (“genie”  assisted)  with  respect  to  the  corresponding  performance 
metric  (CSER  or  BER).  Data  record  based  criteria  for  the  selection  of  the  best  number  of  auxiliary  vectors 
(i.e.  best  AV  filter  in  the  sequence)  have  been  recently  reported  in  [45],  [46]. 

In  Fig.  8.3  the  coarse-synchronization-error-rate  performance  is  depicted  as  the  data  support  size  ranges 
from  15  to  200  samples.  The  SNRs2  of  the  interferers  are  fixed  at  10,  12,  14,  16,  18,  20,  22,  24,  26,  28dB. 
The  SNR  of  the  user  of  interest  is  fixed  at  22dB.  We  observe  that  the  AV  order- L  and  the  AV  order-2L 
curves  are  close  to  each  other  which  suggests  that  the  AV-type  algorithms  are  not  very  sensitive  with  respect 
to  the  filter  length.  Both  AV-type  algorithms  outperform  significantly  the  L-order  and  2L-order  MVDR 
algorithms  (in  the  small  data-support-size  region  the  L-order  MVDR  algorithm  achieves  the  same  CSER  as 
the  2L-order  MVDR  algorithm  approximately  L  samples  faster). 

To  quantify  the  (in-) sensitivity  of  the  proposed  receivers  to  synchronization  errors  we  compare  the  BER 
performance  of  the  AV  and  the  SMI-MVDR  receiver  to  the  performance  of  their  perfectly  synchronized 
counterparts.  Fig.  8.4  depicts  the  performance  of  the  proposed  receivers  as  a  function  of  the  data  record 
size  for  receivers  of  order  L  and  2 L.  The  user  SNR  setup  in  Fig.  8.4  is  identical  to  the  setup  in  Fig.  8.3. 
We  observe  that  the  proposed  receivers  perform  close  to  the  perfect  synchronization  performance  levels. 
As  in  Fig.  8.3,  the  best  performance  is  attained  by  the  AV-type  schemes  while  the  L-order  SMI-MVDR 
outperforms  the  2L-order  SMI-MVDR  over  the  whole  data-record-size  range. 

In  Fig.  8.5  we  show  the  BER  performance  of  the  L  and  2L-order  diagonally  loaded  SMI-MVDR  receivers. 
The  2L-order  SMI-MVDR  receiver  utilizes  a  sample-average  autocorrelation  matrix  that  is  diagonally  loaded 
by  the  amount  atrace(R)I,  where  a  =  10-4  as  suggested  in  [42].  For  a  fair  comparison,  the  same  diagonal 
loading  value  is  used  in  the  implementation  of  the  L-order  algorithm.  We  understand,  however,  that  any 
algorithm  that  utilizes  diagonal  loading  is  sensitive  to  the  specific  choice  of  the  real- valued  diagonal-loading 
parameter.  The  results  in  Fig.  8.5  are  quite  similar  to  those  of  Fig.  8.4  and  the  main  benefit  is  operation 
under  sample  support  less  than  the  filter  order. 

In  Figs.  8.6  and  8.7,  the  coarse-synchronization-error-rate  performance  is  shown  as  a  function  of  the 

2The  SNR  of  user  k,  k  =  0, . . . ,  K  —  1,  is  defined  as  Ek/cr2  where  a2  is  the  variance  of  the  chip-matched  filtered 
and  sampled  AWGN. 
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Data  Record  Size 

Fig.  8.3.  Coarse-synchronization-error-rate  as  a  function  of  the  data-record-size  for  linear  receivers 
of  order- L  and  order-211. 

SNR  of  the  user  of  interest  for  receivers  of  order  L  and  2 L,  respectively.  The  SNRs  of  the  interferers  are 
the  same  as  in  Fig.  8.3.  To  illustrate  the  performance  of  the  receivers  in  short  data  record  scenarios,  we 
choose  the  total  number  of  samples  used  for  the  estimation  of  the  autocorrelation  matrix  to  be  slightly  larger 
than  the  employed  filter  order.  Thus,  for  the  L-order  algorithms  55  samples  are  used,  while  for  the  2L-order 
algorithms  85  samples  are  used.  It  is  interesting  to  note  that  the  L-order  SMI-MVDR  receiver  performs 
close  to  its  2L-order  counterpart,  even  though  the  former  utilizes  a  smaller  data  record  size.  As  expected, 
the  AV  receivers  (of  order  L  or  2 L)  offer  the  lowest  synchronization  error. 

Figs.  8.8  and  8.9  depict  the  BER  as  a  function  of  the  SNR  of  the  user  of  interest  for  receivers  of  order 
L  and  2 L,  respectively.  The  interfering  users  SNR  setup  in  Figs.  8.8  and  8.9  is  identical  to  that  in  Fig.  8.3. 
As  in  Figs. 8. 6  and  8.7,  55  samples  are  used  to  estimate  the  autocorrelation  matrix  of  the  L-order  algorithms 
and  85  samples  for  matrix  estimation  of  the  2L-order  algorithms.  The  performance  results  in  Figs.  8.8  and 
8.9  parallel  the  CSER  findings  in  Figs.  8.6  and  8.7,  while  -  as  in  Fig.  8.4  -  the  proposed  receivers  perform 
close  to  the  perfect  synchronization  performance  levels. 

8.7  Conclusions 

We  considered  the  receiver  design  problem  for  asynchronous  DS/CDMA  systems.  We  focused  on  blind 
combined  synchronization/demodulation  receiver  designs  that  lead  to  simple  structures  and  eliminate  the 
need  for  an  extra  stage  that  performs  timing  extraction  of  the  desired  transmission.  Along  these  lines,  first  we 
developed  an  MVDR-type  structure  of  order  L  with  enhanced  performance  over  higher  order  structures  of  the 
same  type.  To  overcome  the  basic  limitations  of  MVDR-type  filtering  under  short-data-record  operations, 
we  developed  an  L-order  linear  structure  based  on  auxiliary-vector  filtering  principles.  Computationally 
efficient  implementations  of  the  proposed  schemes  were  discussed  and  simulation  studies  were  performed  that 
demonstrated  the  superiority  of  the  proposed  L-order  schemes  when  the  objective  is  rapid  synchronization 
and  reliable  demodulation  with  limited  data  support. 
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Fig.  8.4.  Bit-error-rate  as  a  function  of  the  data  record  size  for  linear  receivers  of  order- L  and 
order- 2L  (the  BER  performance  of  their  perfectly  synchronized  counterparts  is  also  included  for 
reference). 


8.8  Appendix 


Proof  of  Proposition  8.1 

In  a  power  controlled  system,  Ei  =  E,  i  =  0,  * . . ,  K  —  1.  If  w ^tMVDR  =  ,r  ,;._s1°  t ,  l  =  0,  —  ,  L  —  1,  then  the 

output  energy  is  given  by  (sq  R(^  Sq)_1.  For  the  system  under  consideration,  the  autocorrelation  matrix 

R(')  is  given  by 

=  E’£(l)2(l)T  +  a2I  (8.51) 

where 

(8.52) 

and  the  Lx  2  matrix  Ek\  k  =  0, . . . ,  K  —  1,  is  constructed  from  the  user  signatures  as  follows: 


with 


/  (1  -  ^)[011^_0,sfc(0), . . .  ,sfc(i  -  -  l)]r+  \ 

n'k 

5fcfe^3,afc(o),''',afc(L_n#fc)^ 

(1  -  S'k)[sk(L  -n'k),...,sk(L-  1),  O^Of + 

L-n.'k 

S'k  [sk(L  -n'k  +  l),...,sk(L-  1),  0, . . . ,  0]T 

V  L-K-1  / 


J  l  -  nk  if  l  <nk  ,  w  _  f  Sk  if  l  <  nk 
\  L+  (l  -  nk )  if  l  >  nk  an  k  (1-4  if  l  >  nk 


(8.53) 


(8.54) 


114 


Data  Record  Size 

Fig.  8.5.  Bit-error-rate  as  a  function  of  the  data  record  size  for  diagonally  loaded  SMI-MVDR  re¬ 
ceivers  of  order- L  and  order- 2 L  (the  BER  performance  of  their  perfectly  synchronized  counterparts 
is  also  included  for  reference). 


Let  A^\  . . . ,  Xp1  be  the  non-zero  eigenvalues  of  and  v^\  . . . ,  Vp1  their  corresponding  orthonormal 

eigenvectors.  Then,  we  may  express  R(^  as  follows: 

R(,)  =  a2I  +  £V(,)  A(,)V(i)T  (8.55) 

where  =  diag[Xi\  . . . ,  A^]  and  V^)  =  [v^\  . . . ,  v^J,  The  inverse  of  R(^  is  given  below  [47]: 

R(')_1  =  I_  V(°  (^A('rl  +  lj  V(,)T  .  (8.56) 

Assuming  without  loss  of  generality  that  ||sg||2  =  1  we  have 

s*TR(')_1sS  =  1  -s*TV(i)  ^A(i)_1  +  lj  V(,)TsS  .  (8.57) 


The  above  expression  implies  that  minimization  of  Sq  R(d  Sq  is  equivalent  to  maximization  of  Sq  V(d(^A(d  - 
l)-1V(')Tg*.  But, 

lim  sSTV(,)(— A(')_1  +  l)“1V(')TsS  =  s*TV(i)V(,)Ts*  =  ||V(')TsS||2  (8.58) 

E— >oo  E 


and 

||V(')Ts2H2  <  HsDII2  =  1  =  ||V("o)TsS||2.  (8.59) 

Thus,  there  exists  an  E*  >  0  such  that  for  E  >  E*  argmax;  SgTRd)  '  sg  =  n0  if  and  only  if  ||Vd)Ts2||2  <  1 
for  l  ^  no-  Indeed,  the  latter  condition  is  satisfied  if  and  only  if  Sg  does  not  lie  in  the  subspace  spanned  by 
the  columns  of  for  l  hq.  □ 
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Fig.  8.6.  Coarse-synchronization-error-rate  as  a  function  of  the  SNR  of  the  user  of  interest  for  linear 
receivers  of  order- L  (sample  support  N  =  55). 


Proof  of  Proposition  8.2 

(i)  We  observe  that 


ao(^)w(j  0°MVDR 

=  (l-2<i)R(”o)  'so, 

(8.60) 

=  25R("°)_1  (0.5s0  +  0.5sq+1)). 

(8.61) 

Then, 

ao(fi)wd0°,MVDR  +  al(^)Wd1°MVDii  =  R*”0'  s||0||(^)' 

(8.62) 

So,  the  filters  w  G  W  are  of  the  form 

w  = 

R(fto)_1ioii^) 

(8.63) 

s^ii^R^O-sloii^)- 

(ii)  (8.63)  implies  that  the  maximization  in  (8.34)  over  all  5  £  [0, 1)  is  equivalent  to  maximization  of 
wtR(”°)w  over  all  w  e  W.  □ 


Proof  of  Proposition  8.3 

From  the  partition  matrix  inversion  formula  [47]  we  have 

r(i+ D-1  =  (  (C  -  ^-bl+1bf+1)  _1  (b^C-^+i.  -  A 

V  (b,';,C  -  a^r'b^ C-1  (al+1  -  b,'\  ,C  'b,H)  1  )  ’ 

while  from  the  matrix  inversion  lemma  we  have 


1 

<M+ 1 


b/+i 


-i 


=  c-1 


C~1b;+1b^1C-1 
ai+i-  bf+1C-1b;+1‘ 


(8.64) 


(8.65) 
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Fig.  8.7.  Coarse-synchronization-error-rate  as  a  function  of  the  SNR  of  the  user  of  interest  for  linear 
receivers  of  order-2L  (sample  support  N  =  85). 


Partition  R(^  as  follows: 

c)  <8-66> 

where  a;  is  a  scalar  and  b;  is  an  (L  —  1)  x  1  vector.  Then,  application  of  the  partition  matrix  inversion 
formula  and  the  matrix  inversion  lemma  leads  to  the  following  expressions  for  T,  a,  and  (3  in  (8.50): 


r  = 

p-i  .  c-^.bfc-1 
a,-  bfC-'br 

(8.67) 

a  = 

(a,  -bfC-'b,)”1, 

(8.68) 

0  = 

-'(i+  brbT‘b'Vv 

a;  V  a;  -  b /  C ~xbi  J 

(8.69) 

But,  bfC^bi  —  ai  —  a-1.  Hence,  /3  = 

— aC_1b;  which  implies 

C_1b;  =  -P. 

a 

(8.70) 

From  (8.67)  and  (8.70)  we  obtain 

C1  =  r  -  -/3/3t. 
a 

(8.71) 

□ 
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Fig.  8.8.  Bit-error-rate  as  a  function  of  the  SNR  of  the  user  of  interest  for  linear  receivers  of  order- 1, 
(the  BER  performance  of  their  perfectly  synchronized  counterparts  is  also  included  for  reference). 
The  sample  support  N  =  55. 


Fig.  8.9.  Bit-error-rate  as  a  function  of  the  SNR  of  the  user  of  interest  for  linear  receivers  of 
ordcr-2L  (the  BER  performance  of  their  perfectly  synchronized  counterparts  is  also  included  for 
reference).  The  sample  support  is  N  =  85. 
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Chapter  9 


Rapid  Combined 
Synchronization/Demodulation 
Structures  for  DS/CDMA  Systems, 
Part  II:  Finite  Data  Record 
Performance  Analysis 

9.1  Introduction 

The  effectiveness  of  a  receiver  designed  for  rapidly  changing  wireless  direct-sequence  code-division-multiple- 
access  (DS/CDMA)  communication  environments  depends  on  the  following  design  attributes:  (i)  low  com¬ 
putational  complexity,  (ii)  multiple-access-interference  (MAI)  near-far  resistance,  and  (in)  system  adaptivity 
with  superior  performance  under  limited  data  support.  Adaptive  short-data- record  designs  appear  as  the 
natural  next  step  [l]-[4]  to  a  matured  discipline  that  has  extensively  addressed  the  first  two  design  objectives 
in  ideal  setups  (perfectly  known  or  asymptotically  estimated  statistical  properties)  [5],  [6].  System  adaptiv¬ 
ity  based  on  short  data  records  is  necessary  for  the  development  of  practical  receivers  that  exhibit  superior 
bit-error-rate  performance  when  they  operate  in  rapidly  changing  communication  environments  that  limit 
substantially  the  available  data  support. 

In  Chapter  8  we  considered  self-synchronized  receivers  (integrated  synchronizers/demodulators)  and  we 
presented  three  schemes  along  the  classical  receiver  evolution  path.  We  started  with  a  matched-filter-type 
(MF)  structure,  we  continued  with  a  minimum- variance-distortionless- response-type  (MVDR)  scheme  and, 
finally,  we  developed  an  auxiliary-vector-type  (AV)  alternative,  all  of  order  either  twice  or  equal  to  the 
system  spreading  gain  L.  Simulation  studies  illustrated  the  performance  of  the  proposed  structures  in  terms 
of  bit-error-rate  (BER)  versus  data  support  for  a  given  signal-to-noise  ratio  (SNR)  of  the  user  of  interest  or 
in  terms  of  BER  versus  SNR  for  a  given  data  record  size. 

In  this  chapter,  we  develop  three  methods  for  the  analysis  of  the  finite  data  record  behavior  of  self- 
synchronized  linear  receivers  and  in  particular  the  effect  of  the  filter  order  and  the  data-record-size  on 
the  coarse-synchronization-error-rate  (CSER).  We  derive  analytic  expressions  that  approximate  closely  the 
probability  of  coarse  synchronization  error  of  MF-type  and  MVDR- type  schemes.  Probability  of  coarse 
synchronization  error  analysis  for  AV-type  schemes  based  on  a  finite  data-record-size  is  prohibitively  complex 
and  thus  not  attempted  at  this  time. 

The  chapter  is  organized  as  follows.  In  Section  9.2,  we  present  the  system  model  and  we  summarize 
briefly  the  combined  synchronization/demodulation  algorithms  developed  in  Chapter  8.  In  Section  9.3,  we 
develop  analytic  expressions  that  approximate  the  coarse  synchronization  error  probability  ( Pcse )  while  in 
Section  9.4  we  derive  a  sequence  of  increasingly  tight  lower  bounds  on  Pcse.  In  Section  9.5  we  discuss  the 
effect  of  the  filter  order  and  the  data-record-size  on  the  coarse  synchronization  error  rate.  The  accuracy  of 
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(b) 

Fig.  9.1.  Coarse  synchronization  based  on  filters  of  order  L. 


the  analytic  approximations  of  Sections  9.3  and  9.4  is  examined  through  simulations  in  Section  9.6.  Some 
final  conclusions  are  drawn  in  Section  9.7. 


9.2  System  Model  and  Algorithmic  Description 

We  consider  an  asynchronous  DS/CDMA  system  populated  by  I\  active  users  transmitting  over  a  common 
additive  white  Gaussian  noise  (AWGN)  channel.  The  received  signal  r(t)  is  the  superposition  of  the  K 
transmissions  corrupted  by  channel  noise,  i.e. 

K— 1  oo 

r(t)  =  X  V^kbk{i)Sk{t  -  iT  -  rk)  +  n(t).  (9.1) 

k= 0  i=— oo 

In  the  above  expression,  with  respect  to  the  fcth  user  Tk  is  the  signal  delay  relative  to  the  receiver’s  time 
reference  point,  bk(i)  G  {  —  1,  +1}  is  the  1th  information  bit,  Ek  is  the  transmitted  energy,  T  is  the  information 
bit  period,  and  n(f)  is  AWGN.  The  normalized  signature  waveform  5),  (t)  assigned  to  the  kth  user  has  the 
form 

L—i 

Sk(t)^J2sk(l)pTc(t-lTc),  A:  =  0, . . . ,  7\  —  1,  (9.2) 

1=0 

where  Tc  is  the  chip  duration,  Ptc(-)  is  a  rectangular  pulse  with  support  [0, Tc],  L  =  T/Tc  is  the  system 

spreading  gain,  and  Sk(l)  is  the  ?th  element  of  the  normalized  bipolar  signature  or  spreading  vector  s k  = 
[sfc(0),  Sfc(l), . . . ,  Sk(L—  1)]T  that  uniquely  identifies  the  A:th  user.  Without  loss  of  generality,  we  assume  that 
Tk  G  [0,  T),  A:  =  0, . . . ,  K  —  1.  Thus,  we  may  write  the  delay  Tk  as  a  sum  of  an  integer  multiple  of  chips  plus 
a  fraction  of  a  chip:  Tk  =  (rife  +  5k)Tc  where  G  {0, . . . ,  L  —  1}  and  5k  £  [0, 1).  After  chip-matched  filtering 
and  chip-rate  sampling,  the  continuous-time  signal  r(t)  in  (9.1)  is  converted  to  a  discrete-time  sequence 

Mn)}“=-oo  where  r(n)  =  /X+1)T°  r(t)dt. 

In  this  work  we  analyze  the  coarse  synchronization  performance  of  L  and  2L-order  algorithms  of  MF 
and  sample-matrix-inversion  (SMI)  MVDR  type.  A  brief  description  of  the  coarse  synchronization  step  of 
these  algorithms  follows  (details  can  be  found  in  Chapter  8). 

Fig.  9.1  shows  the  block  diagram  representation  of  an  L-order  coarse  synchronizer/demodulator.  The 
buffer  follows  a  chip-rate  sampler  and  groups  the  received  samples  into  L  sequences  of  vectors,  {r^}*  G 
$lL,  l  =  0,...,L  —  1,  as  depicted  in  Fig.  9.2  where 

rj°  =  [r(iL  +  l),r(iL  +  l  +  l),...,r((i  +  l)L  +  l-l)]T,  l  =  0,...,L-l.  (9.3) 
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Fig.  9.2.  Construction  of  the  sequences  r^,  l  =  0, . . . ,  L  —  1. 


In  the  case  of  2L-order  algorithms,  the  received  samples  {r(n)}n  are  grouped  into  a  sequence  of  over¬ 
lapping  vectors,  r^e  tt2L ,  obtained  by  a  sliding  window  such  that  successive  vectors  share  L  common  data 
samples: 

vi=[r(iL),r{iL+l),...,r{{i  +  2)L-l)]T .  (9.4) 

To  facilitate  our  presentation  overlined  variables  always  refer  to  2L-order  processing  to  distinguish  themselves 
from  the  corresponding  variables  used  by  L-order  algorithms.  Table  9.1  provides  the  definition  of  both  the 
ideal  and  estimated  decision  statistic  (filter  output  energy)  of  each  filter  under  consideration,  namely  SMI- 
MVDR  and  MF  of  order  L  or  2 L.  Table  9.2  provides  additional  definitions  needed  in  the  expressions  of 
Table  9.1. 

For  the  L-order  case  of  Table  9.1,  R^  =  E{ }  is  the  input  covariance  matrix  and  RF-1  = 
~k  Sfco1  is  its  sample  average  estimate  (TV  is  the  total  number  of  samples  or  data  record  size).  Sim¬ 
ilarly,  for  the  2L-order  case,  R=  E{ }  and  R  =  .  In  Table  9.2,  Sg+1-*  =  [0,  so(0),  •  •  • ,  sq(L  — 

2)]T  denotes  the  one-chip  right-shifted  zero-filled  version  of  So,  while  Sg\  l  =  0, . . . ,  L  —  1,  is  the  /-shifted 
version  of  the  zero-padded  2L-long  vector  [sj,  0, . . . ,  0]T. 

An  SMI- MVDR- type  or  MF-type  coarse  estimate  of  the  delay  tq  can  be  determined  as  follows: 


TO  = 


(Umax/2J)TC,  if  jmax  is  even 

(Limax/2J  +  0.5)TC,  if  jmax  is  odd 


where 


=  arg  max  <  Uj 


3  =  0,  • 


,2L  -  1 


and  Uj  is  obtained  from  Table  9.1  for  each  filter  respectively. 


(9.5) 

(9.6) 


9.3  Finite  Data  Record  Performance  Analysis 


The  probability  of  coarse  synchronization  error  is  defined  as  the  probability  that  the  coarse  synchronization 
algorithm  fails  to  provide  an  estimate  within  the  “pull-in”  range.  In  this  chapter,  we  adopt  the  commonly 
used  assumption  that  the  pull-in  range  is  equal  to  |TC  around  the  correct  delay  To-  However,  the  performance 
analysis  presented  in  this  section  can  be  carried  out  with  straightforward  modifications  for  any  given  pull-in 
range  value.  The  probability  of  coarse  synchronization  error  is  given  by 


Pcse  Bl-Pr 


I  TO  “TO  |  <  2Tc 


(9.7) 


where  the  coarse  estimate  of  to  is  given  by  (9.5),  (9.6).  We  recall  that  jm ax  in  (9.5),  (9.6)  is  the  index  of  the 
filter  with  the  highest  output  energy.  If  we  define  P(tq)  as  the  set  of  the  filter  indices  that  yield  a  timing 
estimate  to  within  \TC  about  To,  then  the  probability  of  coarse  synchronization  error  will  be  equal  to 

Pcse  =  1  -  Pr  [jmax  £  %(t0)]  • 
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(9.8) 


Different  choices  of  the  “pull-in”  range  can  be  accommodated  by  modifying  appropriately  the  definition  of 
T-L(to).  In  the  rest  of  this  section  we  derive  close  approximations  of  the  coarse  synchronization  performance 
of  the  four  algorithms  under  consideration. 


9.3.1  MF-type  receivers  (2L-order) 

— ( t  d'  —  — (t)  ■—  — —  — (j) 

We  recall  that  in  this  case  Uj  =  d||o||  R  d||o|  and  Uj  =  d||o||  R  d||o||>  j  =  0, 1, . . . ,  2L  —  1.  Then,  the  coarse 

f  2  L) 

synchronization  error  rate  Pt.ae  MF  of  the  2L-order  MF  algorithm  can  be  expressed  as  the  probability  that 
the  index  of  the  largest  decision  variable  Uj,  j  =  0, 1, . . . ,  2L  —  1,  is  not  contained  in  'H(to);  that  is, 


p(2i) 
r cse,MF 


1  -Pr 

arg  max  <  Uj 

3  ^ 

j  =  0, . . . ,  2L  —  1  j  G  PL{tq) 

(9.9) 

1  -Pr 

max  |  Uk  k  G  'H(to)  j  >  max  j  Uj 

j  =  o 

,...,2L-l,j  tn(T0)} 

.  (9.10) 

(2L) 

The  latter  form  of  P^se  'MF  is  particularly  easy  to  handle  as  we  will  see  in  the  sequel.  We  observe  that  the 
evaluation  of  the  joint  probability  density  function  (pdf)  of  the  random  quantities  Uj,  j  =  0, . . .  ,2L  —  1, 

requires  knowledge  of  the  pdf  of  the  random  matrix  R.  We  recall  that  R  is  the  sample  average  estimate 
of  the  autocorrelation  matrix  R  of  the  received  vectors  r,  which  are  distributed  according  to  a  mixture 
of  23A  Gaussian  distributions  and  they  are,  by  construction,  statistically  dependent.  Unfortunately,  the 
use  of  exact  closed  form  expressions  for  the  pdf  of  the  sample  covariance  matrix  that  is  formed  by  vector 
samples  drawn  from  a  Gaussian  mixture  distribution  [7]  leads  to  mathematically  intractable  expressions  for 
the  probability  of  synchronization  error.  An  additional  complicating  factor  for  the  analysis  of  the  finite 
data  support  behavior  of  the  MF-type  (and  SMI- MVDR- type)  receivers  is  the  underlying  dependence  of 
the  vector  samples.  To  that  respect,  we  make  the  following  simplifying  assumptions.  We  assume  that  the 
received  vectors  are  uncorrelated  and  identically  Gaussian  Af(0,  R)  distributed  (thus  i.i.d.)  [8], [9].  The 

latter  Gaussianity  postulation  can  be  considered  as  a  wishful  approximation  of  the  true  distribution,  while 
the  former  independence  postulation  can  be  justified  if  we  consider  “guard”  bands  of  size  L  between  the 
received  vectors  so  that  successive  input  vectors  do  not  share  common  information  bits.  The  effect  of  the 
above  assumptions  will  be  examined  in  the  simulations  section  through  comparisons  with  the  exact  Pcse- 
It  is  known  [10]  that  if  xo,  xi, . . . ,  xjy-i  £  are  i.i.d.  A7(0,  S)  vectors,  then  the  distribution  of  the 
matrix  X  =  x*x?  is 


|X|bv-p-1)/2exp{  —  ItrS_1X} 

* “  2^/27tUp-1)/4|S|a/2 HU  r  [(TV  +  1  -  *)/ 2]  ’  P  >  ’ 


(9.11) 


where  T(-)  is  the  Gamma  function.  The  distribution  in  (9.11)  is  called  Wishart  distribution  with  N  degrees 
of  freedom  and  is  denoted  by  Wp(X;  TV).  A  random  matrix  distributed  according  to  (9.11)  is  called  Wishart 

matrix.  In  our  case,  R  is  approximately  Wishart  with  distribution  W2l(R  /N;N).  If  we  define  the  matrix 


—  A  .-(0)  -(1)  — (2L  — 1) . 

D~  (dllo|h  d||0||,  -  -  - ,  d||0||  ), 


(9.12) 


then  the  random  variables  Uj,  j  =  0, . . . ,  2L  —  1,  form  the  main  diagonal  of  D  R  D-  For  a  full  rank  matrix 

_  _ y  ^  _  _ y _  /s. 

D,  D  R  D  is  Wishart  W2l{jj  D  RD  ;TV)  [11]  and  the  main  diagonal  elements  Uj,  j  =  0,  ...,2L  —  1, 
are  jointly  distributed  according  to  a  multivariate  Gamma  distribution  [12].  However,  D  is  not  full  rank  in 
general  and  even  if  it  were,  it  would  have  been  of  little  help  since  closed  form  expressions  of  the  multivariate 

Gamma  pdf  do  not  exist  except  for  some  special  cases  [13].  Thus,  appropriate  approximations  of  the  joint 

_ t  a  _ 

pdf  of  the  diagonal  elements  of  D  R  D  with  tractable  multivariate  integrals  are  needed. 

Motivated  by  the  transformation  noise  modeling  techniques  used  in  [14],  we  approximate  the  joint  pdf 
of  the  MF  decision  variables  Uj,  j  =  0, . . . ,  2L  —  1,  by  the  joint  pdf  of  a  nonlinear  transformation  of 
Gaussian  random  variables  Yj,  j  =  0, . . .  ,2L  —  1.  The  nonlinear  transformation  and  the  covariance  of  Yj, 
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j  =  0, . . . ,  2L—  1,  are  chosen  such  that  the  outputs  of  the  nonlinear  transformations  have  identical  covariances 
with  the  corresponding  Uj,  j  =  0, . . . ,  2L  —  1,  and  marginal  pdfs  that  are  close  approximations  to  the  pdfs 
of  Uj,  j  =  0, . . . ,  2L  —  1.  The  following  lemma  identifies  this  non-linear  transformation  and  the  statistics  of 
the  random  variables  Yj,  j  =  0, . . . ,  2L  —  1.  The  proof  is  included  in  the  Appendix. 

Lemma  9.1  The  pdf  of  Uj  can  be  approximated  by  the  pdf  of  Uj  Y? ,  j  =  0, ...  ,2L  —  1,  where  the  random 
variables  Yj,  j  =  0, . . . ,  2L  —  1,  are  uncorrelated  Gaussian  with  mean  1  —  and  variance  .  □ 


The  probability  of  coarse  synchronization  error  can  now  be  evaluated  as  follows.  Let  the  random  variables 


Zj ,  j  =  0, . . . ,  2L  —  1  be  defined  as 


QN  i  /o 

Z<=^—U)/ZYj,  j  =  0, . . .  ,2L  —  1. 


9N-2  j  3' 


(9.13) 


It  is  straightforward  to  show  that: 


Pr 


{Uk  |  A:  E  'H(ro)}  >  max  |  Uj 


j  —  Q, ...  ,2L  —  1 


,j  <£  'H(ro)} 


Pr  [max  {  UkYk  |  k  E  'H(to)}  >  max  {  UjYf  \  j  =  0, . . . ,  2L  -  1,  j  £  = 

Pr  [max  {  Zk  \  k  E  >  max  {Zj  \  j  =  0, . . .  ,2L  -  1,  j  = 

Pr  [max  {  Zj  \  j  =  0, . . . ,  2L  -  1,  j  £  'H(t0)}  -  max  {  Zk  \  k  E  'H(ro)}  <  0] . 


(9.14) 


The  random  variables  Zj,  j  =  0, . . . ,  2L  —  1,  are  uncorrelated  Gaussian  (thus,  independent)  with  mean 
Pj  =  Uj^3  and  variance  <r?  =  U^3.  The  pdf  of  Zj,  j  =  0, . . . ,  2L  —  1,  is 


fj(z)  =  —9 


Pj 


(9.15) 


where  g{z)  =  exp(— z2/2)  is  the  pdf  of  a  Gaussian  random  variable  with  mean  0  and  variance  1,  while 
the  cumulative  distribution  function  (cdf)  of  Zj  is 


Tj(z)  =  Q 


Pj 


(9.16) 


where  G(z)  is  the  cdf  of  a  Gaussian  random  variable  with  mean  0  and  variance  1.  The  cdf  of  max{Zj  \  j  = 
0, . . . ,  2L  -  1,  j  (f  n{T0)}  is  given  by  [16] 


2L-1 

•F max.X'Hfro)  (-^)  n  Fj(z), 

j=0,jgH(TO) 


(9.17) 


where  the  index  indicates  that  the  maximum  is  taken  over  a  set  that  does  not  include  the  elements 

of  'H(ro).  Therefore,  the  corresponding  pdf  is  given  by 


2L—1 


2L-1 


/max,\«(To)W  =  /*(*)  II 

Similarly,  the  cdf  and  pdf  of  ma x{Zk  \  k  E  TL(tq)}  are  given  by 

•Ama T j  (z) 
je'HUo) 


and 


/max.W(r0)(z)  =  ^  fl(z)  [Q  Fj(z)  ]  , 

le'HUo)  \  je'H(T0),j^i 


(9.18) 


(9.19) 


(9.20) 
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respectively.  Since  Zj,  j  =  0, . . . , 2L  —  1,  are  independent,  the  two  random  variables  ma x{Zj  j  =  0, ...  ,2 L  — 
1,  j  i  'H(to)}  and  maxlZk  I  k  G  are  also  independent,  therefore  the  pdf  of  their  difference  max{  Z.,  I  j  = 

0,  •  ,2 L  -  l,j  *  H(r o)}  -  max{k  1 1  G  tt(r0)}  is  given  by 


/max,  \'H(r0)(~:  2/)/max,"H(r0)  (  V)dV- 


(9.21) 


We  conclude  that  the  probability  of  coarse  synchronization  error  for  the  2L-long  MF-type  coarse  synchroni¬ 
zation  algorithm  is 


p(2i) 

rcse,MF 


1  - 


1  - 


[  fdidz)dz 

'  — oo 

f»0  /*+oo 


/U  /»- l-oo 

I  /max,  \H(to)(Z  l/)/max,'H(ro)  (  V ) djjdz 
-oo  J  —  oo 


0  /*+oo 


1  - 


'  — OO  */  — oo 


2L—1 

/  2L  — 1 

M 

E 

Mz~y )  II  TAz~y) 

1= 

\  3=Q,3:£1,3&'U(to) 

f  \  “ 

/J 

£  ( 

M-y)  II  Fji-y)] 

dy  dz. 

jenu 0)  \ 

(9.22) 

(9.23) 


(9.24) 


Expression  (9.24)  approximates  the  probability  of  coarse  synchronization  error  as  a  double  integral. 


9.3.2  MF-type  receivers  (L-order) 

In  the  case  of  the  MF-type  coarse  synchronization  algorithm  that  utilizes  filters  of  order  L  the  probability 
of  coarse  synchronization  error  Mf  can  evaluated  by  expression  (9.24)  where  fj(-)  and  F)  (•)  denote 
the  pdf  and  cdf,  respectively,  of  Zj,  j  =  0, 1, ...,  2L  —  1  ( Zj  is  defined  as  in  eq.  (9.13)  with  the  difference  that 

Uj  is  now  given  by  Uj  =  dj'bj  R(  Lj /2J ) dj'^  ,  j  =  0, 1, . . . ,  2L  —  1).  The  proof  follows  similar  reasoning  as  in 
Section  9. 3. A  with  the  difference  that  Uj  =  djjb|  R< Lj/2J )djb^ ,  j  =  0, 1, . . . ,  2L  —  1  and  Rdl/2J)  is  Wishart 
Wi,(^R(lt/2J);  TV)  with  TV  degrees  of  freedom. 


9.3.3  SMI-MVDR-type  receivers  (2L-order) 


The  MVDR-type  SMI-implemented  coarse  synchronization  algorithm  that  utilizes  filters  of  order  2 L  can  be 

_ i  —(j)  N  _1 

analyzed  in  a  similar  manner.  In  this  case,  Uj  is  given  by  Uj  =  I  d||o||  R  d||o|| 

If  we  define  the  random  variables  Zj  ==  Uj^Yj,  j  =  0, 1, . . . ,  2L  —  1,  where  the  random  variables 


,  j  =  0, 1, . . . ,  2L  —  1. 


Yj,  j  =  0, 1, . . . ,  2L  —  1,  are  uncorrelated  Gaussian  with  mean  1  — 


9(N—2L) 


and  variance 


9(N  —  2L) 


,  then 


1/3 

Zj,  j  =  0, 1,...,2L  —  1  are  uncorrelated  Gaussian  random  variables  with  mean  jij  =  Uj  and  variance 
<7?  =  Uj  ^  [9(jv*-2L)-2]2  •  "bhe  probability  of  coarse  synchronization  error  P^^mvdr  Is  given  by  expression 
(9.24)  where  Tj{-)  and  fj(-)  denote  the  cdf  and  pdf  of  the  random  variable  Zj,  j  =  0, 1, . . . ,  2 L  —  1.  The 

~  — _1  ~(j) 

proof  follows  again  similar  reasoning  as  in  Section  9. 3. A  with  the  difference  that  Uj  =  I  d||o|  R-  d||o| 

j  =  0, 1, . . . ,  2L  —  1  and  R  is  Wishart  W2l(jj  R;  N)  with  TV  degrees  of  freedom.  The  latter  implies  that  Uj 
can  be  expressed  as  a  multiple  of  a  y2  random  variable  with  N  —  2L  degrees  of  freedom. 


9.3.4  SMI-MVDR-type  receivers  (L-order) 

For  the  MVDR-type  SMI-implemented  coarse  synchronization  algorithm  that  utilizes  filters  of  order  L  we 
have  Uj  =  (dj,J2  R(Li/2J)  dj,-7/')  ,  j  =  0,1, ...  ,2L  —  1.  Then,  the  probabilty  of  coarse  synchronization 
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error  P^e  mvdr  is  stdl  given  by  expression  (9.24)  where  Tj  (■)  and  fj(-)  denote  the  cdf  and  pdf  of  the 
random  variable  Zj  =  j  =  0, . . . ,  2L  —  1.  The  random  variables  Yj,  j  =  0, 1, . . . ,  2L  —  1, 

are  uncorrelated  Gaussian  with  mean  1  —  Q^_L^  and  variance  therefore,  the  random  variables 

Zj,  j  =  0, 1,...,2L  —  1  are  uncorrelated  Gaussian  random  variables  with  mean  fij  =  Uj  and  variance 
<7?  =  U j^3  [g(N-L) -2] 2  •  The  Pro°f  follows  similar  reasoning  as  in  Section  9. 3.  A  with  the  only  difference  that 

Uj  is  now  given  by  Uj  =  Rdi/2J)  1dp|^  and  R( Lj/2J )  1  js  Wishart  Wl(^ R(Li/2J)  1-,N)  with  N 

degrees  of  freedom.  The  latter  implies  that  Uj  can  be  expressed  as  a  multiple  of  a  x 2  random  variable  with 
N  —  L  degrees  of  freedom. 


9.4  A  Recursive  Method  for  the  Approximation  of  the  Probability 
of  Coarse  Synchronization  Error 

In  this  section,  we  develop  a  recursive  procedure  for  the  evaluation  of  the  probability  of  coarse  synchro¬ 
nization  error,  Pcse.  The  method  evaluates  Pcse  as  a  sequence  of  increasingly  tight  lower  bounds  on  the 
approximate  expression  (9.24)  and  is  computationally  simpler  than  the  latter  expression  that  evaluates  Pcse 
as  a  convolution  of  pdfs.  We  recall  that  (9.24)  with  appropriate  definitions  of  fj  (•)  and  Fj(-)  can  be  used 
for  the  evaluation  of  P^MF,  Pcse,MVDiv  and  Pcse,MVDR  as  discussed  in  Section  9.3. 

We  begin  with  the  observation  that  the  cardinality  of  the  set  "H(r o)  is  almost  always1  equal  to  2.  Thus, 
without  loss  of  generality  we  assume  that  'H(to)  =  {0, 1}.  In  addition,  we  assume  that  the  random  variables 
Zj,  j  =  2,  ...,2L  —  1  are  ordered  in  terms  of  decreasing  variance2,  i.e.  Var{Zj}  >  Var{Zj+ 1}  for  j  = 
2, . . . ,  2L  —  2.  Let 


p(.k) 
J  cse 


A 


1  -  Pr 


arg  max{Zj 
j 


j  =  o, i, . . . , + 1} )  en(T0) 


(9.25) 


=  1  -  Pr[max{Z0,  Zx}  >  max{Z2,  Z3,...,  Zk+2}\ 


(9.26) 


where  k  =  0, . . . ,  2L  —  3.  It  is  straightforward  to  check  that  for  any  A:  =  0, . . . ,  2L  —  4,  we  have  Pcs}  <  p}s+^ ■ 
Indeed,  including  the  extra  element  Zk+ 3  to  {Z2, . . . ,  Zk+ 2}  can  only  decrease  the  probability  that  the  index 
of  the  maximum  element  of  {Z2,  ■  ■  ■ ,  Zk+2,  A+3}  will  be  larger  than  max{Z9,  Zi}.  Sorting  the  elements 
by  decreasing  variance  implies  that  the  k  elements  with  the  highest  variance  (and  mean)  will  produce  the 
tightest  lower  bound  PP}  of  all  other  choices  of  k  elements  from  {Z2, . . . ,  Zk+ 2}-  For  k  =  2L  —  3,  all  variables 
Zj,j  =  0,...,2L  —  l,  are  included  in  the  evaluation  of  the  fcth  bound,  so 


p}se  3)  =  1  -  Pr[max{Z0,  Z3}  >  max{Z1,Z2, ...,  Z2L_  1}]. 


(9.27) 


Thus  (cf.  (9.14)), 

pL°}  <  p}ll  <■■■<  p}se~3)  =  Pr  [max  {  Zj  \  j  =  0, . ,  2L  -  l,j  £  n(r0)}  -  max  {  Zk  \  k  e  ^(r0)}  <  0] . 

(9.28) 

The  probability  bounds  PP} ,  k  =  0, . . . ,  2L  —  3,  can  be  evaluated  as  follows.  The  pdf  of  max{Zo;  A}  is 


/max{z0,Zi}(z)  =  fo(z)Fi(z)  +  fi(z)F0(z),  (9.29) 

while  the  pdf  of  max{Z2, . . . ,  Zk+ 2}  is 

k+ 2  /  k+ 2  \ 

/ma  x{Za ^lW=Eb'W  II  ■  (9-30) 

1=2  \  j=2,j^l  j 

1The  cardinality  is  1  if  and  only  if  to  =  nTc  for  some  n  =  0, . . . ,  L  —  1.  However,  the  probability  of  these  events 
is  zero. 

2  Sorting  the  variables  Zj  in  terms  of  decreasing  variance  is  equivalent  to  sorting  them  in  terms  of  decreasing  mean. 
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Then,  the  probability  Pcs, 1  is  equal  to 

/»0  /*+oo 


p(k )  —  \  _ 


/u  n-\-oo 

/  /max{Zo,Zi}(^  —  ]j)  fmax{Z2,...,Zk+2}  (—  y)dydz. 
-oo  J  — oo 


Alternatively,  a  simpler  recursive  procedure  for  the  evaluation  of  the  probability  bounds  Pcse ,  k  = 

(A;) 

0, . . . ,  2L  —  3,  is  proposed  below.  The  probability  PCSe,  k  =  0, . . . ,  2L  —  3,  is  equal  to 

p£l  =  ^  -  ^i°ax}  <  o]  (9.32) 


where 

and 


Z(nkl^max{Z2,...,Zk+2j 


Z^  ^max{Z0,Z,j. 

Assuming  that  ^max  and  ^max^  are  normally  distributed  we  may  calculate  Pcse  as  follows 


e{z£L}-e{z& L’x}} 

Var  {z£L}  +  Var  {zl°^} } 


p£)  -1  -Q 


The  mean  and  variance  of  Zmax  can  be  evaluated  recursively  as  described  by  the  following  lemma. 
Lemma  9.2  For  k  =  1, . . . ,  2L  —  3, 


{^L}  =  e{z£-p}q 


y{k— 1)  1 
Amax  ( 

—  E  {Zk+ 2} 

Var 

f  y(k—l) 
Amax 

|  +  Var  {Zk+2} 

+E{Zk+2}g  — 


E{ 

y{k  1)  1 
Amax  r 

—  E  {Zk+ 2} 

Var 

f  y{k~  1) 

S  ^ma x 

|  +  Var  {Zk+2} 

Ivor  {zgj'1}  +Var{Zk+2}  •  g 


E{ 

y{k  1)  'l 
^max  r 

—  E  {Zk+ 2} 

^ jvar 

/  y{k  —  1) 
"max 

|  +  Var  {Zk+ 2} 

and  (9.36) 


{z£L}  =  Var{z£~P}g 


y{k— 1)  1 
"max  r 

—  E  {Zk+ 2} 

^Var 

f  — 

"max 

|  +  Var  {Zk+2\ 

+Var{Zk+2}g  - 


E{ 

y(k  —  l)  1 

"max  f 

—  E  {Zk+ 2} 

Var 

f  yi.k  —  1) 

\  "max 

|  +Var  {Zk+ 2} 

(E{z£^}-E{zk+2\)g 


y(k— 1)  1 

"max  r 

—  E  {Zk+ 2} 

^ jvar 

/  y(k— 1) 
"max 

|  +  Var{Zfc+2} 

I  Var  {z£~P  }  +  Var  {Zk+2}  ■  g 


y(k— 1)  1 

"max  r 

—  E  {Zk+ 2} 

^ jvar 

/  y(k— 1) 
"max 

|  +  Var  {Zk+2} 
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(E{z£-v)-E{zk+2})g 


/ 

y(k  —  l)  1 
Amax  r 

—  E  {Zk+2} 

V  \ 

jvar 

(  y(k~  1) 
\  Amax 

|  +  Var  {Zk+2} ^ 

yj Var  {z£ ax1 1 }  +  Var  {Zk+2} 


( 

E{ 

y(k  l)'! 
Amax  r 

-E{Zk+2}  \ 

\\ 

jvar 

f  1) 

\  A  max 

|  +  Var  {Zk+2} y 

(9.37) 


with  E  {zi°L}  =  E{Z2}  and  Var  {z£2*}  =  Var{Z2}. 

Proof.  Since 

Z^l  =  max{Z2,...,Zk+2}  =  ma  x{Z^\Zk+2}  (9.38) 

(9.36)  and  (9.37)  provide,  respectively,  the  mean  and  variance  of  the  maximum  of  two  uncorrelated  Gaussian 
random  variables  [17].  □ 

The  mean  and  variance  of  can  be  evaluated  in  exactly  the  same  way.  As  a  final  comment,  we  recall 

that  the  expression  (9.24)  of  Section  9.3  was  derived  based  on  the  assumption  that  the  pdf  of  the  received 
vectors  can  be  closely  approximated  by  the  pdf  of  i.i.d.  Gaussian  random  vectors.  The  recursive  method  for 
the  evaluation  of  Pcse  developed  in  this  section  is  based  on  the  additional  assumption  that  the  maximum  of 
two  independent  Gaussian  random  variables  is  approximately  Gaussian.  The  effect  of  both  assumptions  is 
examined  in  the  simulations  section  through  comparisons  with  the  exact  Pcse- 


9.5  Discussion 

Summarizing  the  developments  so  far,  the  probability  of  coarse  synchronization  error  was  approximated  as 
follows: 

Pcse  —  1  -  Pr[argmax{  Zj  \  j  =  0, . . . ,  2L  —  1}  e  77(r0)]  (9.39) 

j 

where  T-L(tq)  is  the  set  that  contains  the  indices  of  the  filters  that  yield  a  timing  estimate  within  the  pull-in 
range.  The  random  variables  Zq,  . . . ,  Z2l_i  are  uncorrelated  Gaussian  random  variables  with  means  and 
variances  given  by 

E{Zj}  =  U]'\  Var{Zj}  =  C(N)U-/3  (9.40) 

where  the  quantities  Uj,  j  =  0, ...,  2L  —  1,  and  the  functions  C(N)  are  presented  in  Table  9.1  and  9.3, 
respectively,  for  each  filter  under  consideration  (SMI-MVDR  or  MF  of  order  L  or  2 L). 

Thus,  the  probability  of  coarse  synchronization  error  was  calculated  as  the  probability  that  the  index  of 
the  largest  Zj,  j  =  0, . . .  ,2L  —  1,  is  not  contained  in  H(tq).  This  approach  led  to  the  Pcse  expressions  for 
the  order- L  and  order-2L  MF-type  or  SMI-MVDR- type  receivers  presented  in  Section  9.3,  as  well  as  to  the 
evaluation  of  Pcse  through  the  sequence  of  lower  bounds  presented  in  Section  9.4.  An  intuitively  simpler  but 
computationally  more  complex  method  for  the  calculation  of  Pcse  is  presented  in  the  Appendix.  Expression 
(9.40)  and  Table  9.3  imply  that  the  mean  and  variance  of  Zj,  j  =  0,  ...,2L  —  1,  and,  consequently,  the 
performance  of  the  SMI-MVDR-type  algorithms  depend  explicitly  on  the  filter  order.  Moreover,  the  data 
record  size  N  and  the  filter  order  appear  only  in  the  terms  (N—2L)  and  ( N—L )  of  C(N)  for  the  filters  of  order 
2 L  and  L,  respectively.  Although  the  filter  order  does  not  affect  the  asymptotic  (as  N  — »  oo)  performance  of 
the  SMI-MVDR-type  algorithms,  it  does  cause  a  performance  drop  in  the  small  data-record-size  operating 
region  since  the  filter  order  term  is  subtracted  from  the  data-record-size  term.  In  other  words,  the  data  record 
size  is  “effectively”  reduced  by  a  number  of  data  vector  samples  equal  to  the  filter  order.  The  longer  the 
employed  filter  is,  the  greater  the  “effective”  data-record-size  reduction  is.  For  the  SMI-MVDR-type  coarse 
synchronization  algorithms,  this  translates  to  poorer  performance  under  short-data-support  conditions  as 
the  filter  order  increases. 

An  alternative  way  to  quantify  the  effect  of  the  data-record-size  on  the  probability  of  coarse  synchronizat¬ 
ion  error  is  to  examine  the  short-data-record  synchronization  resolution  of  the  SMI-MVDR-type  algorithms 
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Table  9.3.  Definition  of  C(N ) 


L-oi'dcr 

SMI-MYDR 

MF 

C(AT)  —  l8(N~L) 

C(]\T)  —  18  N 

—  [9(N—L)—2]2 

)  ~  (9N—2)2 

2L-ordcr 

SMI-MVDR 

MF 

(~<(  M\  —  18(N—2L) 

r<(  at)  —  18 N 

[9(AT-2L)-2]2 

“  (9iV — 2)2 

that  we  define  as  follows.  For  the  pair  of  decision  variables  ( Zi,Zm ),  l  =  0, . . . ,  2L  —  1,  m  =  0, . . . ,  2L  —  1, 
with  E{Zi}  7^  E{Zm},  the  resolution  metric  is  defined  by 


A  f  Pr[Zt  >  Zm],  if  E{Zt}  >  E{Zm} 
\  Pr[Zi  <  Zm\,  otherwise. 


(9.41) 


That  is,  the  resolution  m  is  the  probability  that  the  coarse  synchronization  algorithm  will  decide  in  favor 
of  the  asymptotically  largest  decision  variable.  In  other  words,  the  synchronization  resolution  is  a  measure  of 
how  close  the  performance  of  the  finite  data  record  SMI-based  algorithm  is  to  the  asymptotic  performance. 
It  is  straightforward  to  show  that  the  resolution  of  the  SMI-MVDR  algorithms  is  given  by 


Pl,m 


Q 


17, 1/3  -  17, 


1/3 


17, 


2/3 


(9.42) 


where  Q (x)  is  the  cdf  of  a  normal  random  variable  with  mean  0  and  variance  1.  The  following  proposition 
compares  the  resolution  performance  of  the  L-order  and  2L-order  algorithms  for  a  given  pair  of  hypotheses. 
The  proof  is  straightforward  and  thus  omitted. 


Proposition  9.1  For  a  given  pair  of  hypotheses  (1,  m),  l  =  0, . . . ,  2L  —  1,  m  =  0, . . . ,  2L  —  1,  we  have 


if  and  only  if 


where 


(L)  (2  L) 

Pl,m  >  PLm 


9  (TV  -2L)-2  y/N-L  ^  c, 

9 (N  -  L)  -  2  y/N-  2 L  ~  l’m 


(d|10|j  R  d||0||)  (d|lo|l  R _ dl|Q||l 

(4Vr"14i)-1/3  -  (dSrR"ldS)-1/3 

y / (djoil  Rodion)"173  +  (d||0|j>rR~1d{|0||42^3 

VKoii  : R(L'/2j)“ld[,o)ii)-2/3  -  (dSTR(Lm/2J)"ldS)_2/3' 


(9.43) 

(9.44) 


(9.45) 

□ 


132 


Let  Smin  =  min{S/m  |  l,m  =  0,  ...,2  L  —  1}.  Then,  the  resolution  performance  of  the  L-order  algorithm 
will  be  better  than  the  performance  of  the  2L-order  algorithm  (in  the  sense  that  the  L-order  algorithm 
performance  will  be  closer  to  the  corresponding  asymptotic  performance)  for  any  pair  of  hypotheses,  provided 
that 

-  2L)  -  2  VN^L 

-  L)  -  2  y/N  —  2L  ~  ’ 

(9.46)  identifies  a  condition  on  the  system  parameters  and  the  data  record  size  under  which  SMI-MVDR-type 
L-order  algorithms  outperform  the  2L-order  algorithms. 

On  the  other  hand,  in  the  case  of  MF-type  algorithms  the  mean  and  variance  of  Zj  do  not  depend 
explicitly  on  the  filter  order.  Theoretically,  this  conclusion  is  a  direct  consequence  of  the  fact  that  for  any 
Wishart  WP(E-,N)  matrix  X  and  any  non-zero  vector  v  E  5RP,  the  quantity  vTXv  is  a  x2  random  variable 
that  maintains  TV  degrees  of  freedom  (while  (vTX-1v)-1  is  y2  with  TV  —  p  degrees  of  freedom).  Therefore, 
algorithms  that  do  not  require  the  inversion  of  a  Wishart  matrix  do  not  suffer  from  “effective”  data-record- 
size  reduction.  Consequently,  it  is  expected  that  the  MF-type  and  AV-type  coarse  synchronization  algorithms 
are  fairly  insensitive  to  the  choice  of  filter  length. 

As  a  final  note,  we  recall  that  an  alternative  to  direct  inversion  of  R(^  is  the  use  of  a  matrix- inversion- 
lemma  based  RLS  algorithm  [15].  In  reality,  the  RLS  algorithm  computes  the  inverse  of  a  diagonally  loaded 
version  of  R(,\  (R(^  +  ^I)_1  where  (-1  is  the  initialization  matrix  of  RLS.  In  fact,  we  can  show  that 
although  RLS  provides  an  asymptotically  (TV  — »  oo)  unbiased  estimate  of  (R(^)-1,  for  a  finite  data  record 
size  TV  it  provides  a  biased  estimate  of  (R(^)_1  with  a  bias  that  depends  on  e/TV.  Since  in  practice  e  is  small, 
the  effect  of  the  diagonal  loading  quickly  becomes  negligible  and,  consequently,  the  performance  is  close  to 
the  SMI  performance  analyzed  in  this  work. 


9.6  Simulation  Results  and  Comparisons 

We  consider  a  10-user  asynchronous  DS-CDMA  system  that  utilizes  Gold  sequences  of  length  L  =  31.  We 
compare  the  coarse-synchronization-error-rate  evaluated  using  the  expressions  derived  in  Sections  III  and 
IV  with  the  exact  coarse-synchronization-error-rate  of  MF  and  SMI-MVDR-type  algorithms.  The  exact 
coarse-synchronization-error-rate  is  evaluated  by  averaging  over  10,000  independent  runs.  The  delays  of  all 
users  are  chosen  randomly  and  kept  constant  for  the  duration  of  the  experiment. 

In  Figs.  9.3,  9.4,  the  SNR3  of  the  user  of  interest  is  lOdB  and  the  data  support  size  ranges  from  35  to 
80  samples.  The  SNRs  of  the  interferers  are  fixed  at  6,  8,  10,  12,  15,  16,  18,  20,  and  22dB. 

(k) 

In  Fig.  9.3,  we  plot  as  a  function  of  k  the  sequence  of  bounds  Pcse  developed  in  Section  9.4  for  the 
order-L  and  order-2L  SMI-MVDR-type  algorithms  (given  by  (9.26)  and  (9.35)-(9.37)). 

In  Fig.  9.4,  we  compare  the  exact  coarse  synchronization  performance  of  order- L  and  order-2L  algorithms 
to  the  derived  approximations  of  Section  9.3  and  9.4,  that  is  the  direct  approximation  through  (9.24)  and 
the  recursive  approximation  through  (9.26),  (9.35)-(9.37).  Regardless  of  the  filter  order,  we  see  that  both 
approximations  offer  close  coarse  synchronization  performance  estimates.  We  observe  that  the  SMI-MVDR 
L-order  algorithm  exhibits  improved  performance  when  compared  to  the  SMI-MVDR  2L-order  scheme  for 
data  record  sizes  above  L  =  31.  In  fact,  in  the  small  data  support  region  the  L-order  algorithm  can  achieve 
the  same  CSER  as  the  2L-order  algorithm  about  L  samples  faster. 

In  Figs.  9.5  and  9.6  we  repeat  the  studies  of  Figs.  9.3  and  9.4  for  different  values  of  the  SNR  of  the  user 
of  interest.  The  data  support  for  the  estimation  of  the  order- L  and  order-2L  SMI-MVDR  filters  is  TV  =  55 
and  80  samples,  respectively.  The  results  parallel  the  findings  in  Figs.  9.3  and  9.4. 

The  insensitivity  of  the  non-inversion-based  coarse  synchronization  algorithms  to  the  filter  order  is  il¬ 
lustrated  by  the  simulation  study  of  Fig.  9.7  where  the  probability  of  coarse  synchronization  error  of  the 
L-order  algorithms  is  plotted  as  a  function  of  the  system  processing  gain  L  (the  figure  also  includes  the 
AV-type  coarse  synchronization  scheme  presented  in  Chapter  8).  The  user  SNR  values  are  identical  to  those 
in  Fig.  9.3  and  the  data  support  for  filter  estimation  is  TV  =  125  samples.  The  signature  vectors  of  the  users 
are  constructed  by  concatenating  Gold  codes  of  length  31.  We  observe  that  the  SMI-MVDR  algorithm  is 
severely  affected  by  the  value  of  the  spreading  gain  (filter  length)  as  L  approaches  TV,  while  the  MF-type 

3The  SNR  of  user  k,  k  =  0, . . . ,  K  —  1,  is  defined  as  Ek/a2  where  a2  is  the  variance  of  the  chip-matched  filtered 
and  sampled  AWGN. 
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Fig.  9.3.  Recursive  approximation  of  the  coarse-synchronization-error-rate  of  SMI-MVDR-type 
receivers  of  order- L  and  order-21,  given  by  (9.35)  -  (9.37)  ( SNRq  =  10 dB). 

algorithm  is  not  affected  (but  has  unacceptable  synchronization  performance  as  also  seen  in  Figs.  9.4  and 
9.6).  The  AV-type  synchronizer  does  not  suffer  from  significant  performance  degradation  as  the  system 
spreading  gain  L  increases  and  offers  by  far  the  most  reliable  synchronization. 

9.7  Conclusions 

In  this  chapter,  we  investigated  the  finite  data  record  coarse  synchronization  performance  of  blind  adaptive 
combined  synchronizers/demodulators.  Using  transformation  noise  modeling  techniques  we  derived  analytic 
expressions  that  approximate  closely  the  probability  of  coarse  synchronization  error.  Three  alternative 
methods  were  developed.  The  first  method  approximates  the  probability  of  coarse  synchronization  error  as  a 
double  integral.  The  second  approximation  method,  which  is  the  least  computationally  complex,  provides  a 
sequence  of  increasingly  tight  lower  bounds  on  the  approximate  expressions  of  the  first  method.  Finally,  the 
third  and  most  intuitive  method  approximates  the  probability  of  coarse  synchronization  error  as  a  2L-order 
integral  (the  expression  is  given  in  the  Appendix).  The  analytic  expressions  provide  simple,  highly- accurate 
alternatives  to  computationally  demanding  performance  evaluation  through  simulations.  We  showed  that 
the  coarse  synchronization  performance  of  the  MF-type  receiver  is  a  function  of  the  data  record  size  while 
the  performance  of  MVDR-type  SMI  or  RLS-based  filter  estimators  is  a  function  of  the  difference  between 
the  data  record  size  and  the  employed  filter  order.  The  latter  translates  to  an  “effective”  reduction  of  the 
data  record  size  with  more  evident  negative  effect  on  the  performance  as  the  available  data  record  becomes 
smaller.  MF-type  and  AV-type  (which  provide  active  interference  suppression)  filter  estimators  do  not  suffer 
such  data  reduction  and  thus  provide  an  attractive  solution  when  the  environment  dictates  short  data  record 
synchronization. 


134 


Order-L 


Fig.  9.4.  Coarse-synchronization-error-rate  of  linear  receivers  of  order  L  and  2 L  as  a  function  of 
the  data  record  size  N  ( SNRq  =  10 dB). 


9.8  Appendix 

Proof  of  Lemma  9.1 

The  marginal  pdf  of  Uj,  j  =  0, . . . ,  2L  —  1,  is 


=  xN/2  1exp{-u/Uj} 
A  (i^i)N/2rW2) 


(9.47) 


Expression  (9.47)  implies  that  the  random  variable  Uj,  j  =  0,  — ,  2L  —  1,  is  a  multiple  of  a  chi-square  (y2) 
random  variable.  More  precisely,  Uj  can  be  written  as 


Uj  =  {^TUj)Xj,  j  =?'(), ....  2/.  1. 


(9.48) 


where  Xj  is  a  y2  random  variable  with  N  degrees  of  freedom.  Since  ( Xj/N )1/3  is  approximately  Gaussian 
with  mean  (l  —  <^)  and  variance  [18],  we  may  approximate  the  pdf  of  Uj  by  the  pdf  of 


Vi 


o  =  N{-Uj)Yf  =  Uj  Yj  , 


3  =  0,  • 


,2L  -  1. 


(9.49) 


In  the  above  equation,  Yj  is  a  Gaussian  variable  with  mean  (l  —  4^)  and  variance  ^ .  The  joint  pdf  of 
Yq,  . . . ,  Y2L-1,  will  be  completely  defined  if  we  specify  their  covariance  matrix  that  we  denote  by  4'.  The  diag¬ 
onal  elements  of  T1  are  equal  to  The  non-diagonal  elements  of  T1  are  chosen  such  that  Vq,.  . . ,  V2L-1  have 
covariances  identical  to  the  covariances  of  the  corresponding  Uq,  . . . ,  U2L-1  variables.  The  latter  covariances 
are  [19]: 


Cov 


Uj,  Uk 


(9.50) 
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Fig.  9.5.  Recursive  approximation  of  the  coarse-synchronization-error-rate  of  SMI-MVDR-type 
receivers  of  order-Z,  and  order-21,  given  by  (9.35)-(9.37)  (N  =  55  and  80,  respectively). 


To  find  the  covariances  %pjk  of  Yj  and  Yj:,  j  ^  k,  that  result  in 


Cov  [Vj,  Vk }  =  - 


ailnll  Raiinll 


(9.51) 


we  substitute  (9.49)  in  (9.51)  and  make  use  of  the  moment  theorem  to  evaluate  E{Y?Y fc3}.  Then,  we  find 
that  the  (j,  A:)th  non-diagonal  element  of  T1,  ipjk,  must  satisfy 


o  in/  2n2(2  (A-  181V  -811V2)2  ,  2 

Hjk  +  18(1  -  gjy)  ^jk  +  9 - diFHFI - L^jk  = 


(dijon  Rdn 


65617V4 


^(dSiRdS^dKiRd  S,) 


or  equivalently 


where  a  = 


o  0/l  2,2,2  _  (4  —  187V  —  817V2)2  1 

'‘Pjk  +  (  -  div)  ^ik  r;  v  rkci  at 4  ^jk  ~  3Na  ~ 


(9.52) 


(9.53) 


'dl|0||  *“11011 J 


(d 


uv 


Rd 


0)  stid® 

Rd  ,,G,| 


97V'  T3K  ’  ~  6  x  65617V4 

-tit—.  By  the  Schwarz  inequality  a  £  (0,1).  Equation  (9.53)  has  three  real 
*4oi) 


roots,  two  of  which  are  always  negative,  provided  that  TV  >  4. 

The  third  root  is  plotted  in  Fig.  9.8  as  a  function  of  TV  and  a.  We  see  that  the  third  root  is  always 
positive  while  it  drops  rapidly  to  zero  as  TV  increases.  In  fact,  for  TV  >  40  we  see  that  ipjk  is  less  than  0.0045. 
Thus,  we  may  assume  that  ijjjk  =0,  j  ^  k,  j,  k  =  0, . . . ,  2L  —  1.  In  other  words,  we  can  safely  assume  that 
the  Gaussian  random  variables  Yj,  j  =  0, . . . ,  2Z,  —  1,  are  uncorrelated  and,  therefore,  independent.  □ 


An  alternative  approximation  of  Pcse 

The  probability  of  coarse  synchronization  error  can  be  approximated  by  (cf.  (9.14)) 

Pcse  ~  1  -  Pr [arg rnax{  Zj  \  j  =  0, . . . ,  2L  -  1}  6  'R(ro)]  (9.54) 
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Order-L  (N=55) 


Fig.  9.6.  Coarse-synchronization-error-rate  of  linear  receivers  of  order  L  and  2 L  as  a  function  of 
the  SNR  of  the  user  of  interest  ( N  =  55  and  80,  respectively). 


where  R(to)  is  the  set  that  contains  the  indices  of  the  filters  that  yield  a  timing  estimate  within  the  pull-in 
range.  The  random  variables  Zj,  j  =  0, . . . ,  2L  —  1,  are  independent  Gaussian  with  mean  Ly  and  variance 
C(N)Uj  where  the  function  C(N)  is  given  in  Table  9.3  for  each  of  the  receivers  under  consideration.  The 
pdf  of  the  random  vector  Z  =  [Z0, . . . ,  Z2£_1]T  is 


/z(z)  = 


(2 n)L  II “o : 1  \/c(N)U]/3 


exp  < 


-£ 


y1  (g  -  vf)1 

%  2  C(7V)C/2/3 


where  z  =  [zG, . . . ,  z2l- i]t-  Thus, 


Pcae  —  1  /  fz(z)dz 

JV0 


where 


Vo=  1J  {(d0,di,. . .  ,d2L-i)  S  5R2i  with  dk  >  dj,  j  =  0, . . .  ,2L  -  1,  j  ±  A:}  . 

k€U{T0) 


(9.55) 


(9.56) 

(9.57) 
□ 
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Processing  gain  L 


Fig.  9.7.  Coarse-synchronization-error-rate  of  linear  receivers  of  order  L  as  a  function  of  the  pro¬ 
cessing  gain  L. 


N 


Fig.  9.8.  Positive  root  of  (9.53)  as  a  function  of  a  and  data-record-size  N. 
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Chapter  10 


Spatial  and  Temporal  Processing  for 
Global  Navigation  Satellite  Systems: 
The  GPS  Receiver  Paradigm 

10.1  Introduction 

The  Global  Positioning  System  (GPS),  originally  developed  for  military  use,  has  received  a  lot  of  attention 
recently  for  use  in  civilian  applications  such  as  aviation,  agriculture,  land-vehicle  navigation,  surveying  and 
mapping,  to  name  a  few  [1]- [5] .  The  GPS  satellite  navigation  system  comprises  the  space  segment,  the 
control  segment  and  the  user  segment.  The  space  segment  refers  to  the  constellation  of  24  GPS  satellites  out 
of  which  six  to  eight  are  usually  in  simultaneous  view  by  a  GPS  receiver.  The  control  segment  is  responsible 
for  the  management  and  monitoring  of  the  GPS  system  operations.  It  involves  monitor  stations  and  ground 
antennas  that  generate  and  upload  navigation  data  to  each  GPS  satellite.  The  user  segment  refers  to  receiver 
related  activities  such  as  estimation  of  the  receiver  position,  velocity  and/or  acceleration. 

The  GPS  system  employs  direct-sequence  spread-spectrum  (DS-SS)  signaling  [6]- [8]  and  utilizes  two 
carriers,  namely,  the  Li  carrier  at  1575.42  MHz  and  the  L2  carrier  at  1227.6  MHz.  Each  satellite  is  assigned 
a  coarse  acquisition  (C/A)  code  and  a  precision  ( P )  code.  Both  types  of  codes  belong  to  the  general  class  of 
product  codes,  i.e.,  they  are  formed  by  the  product  of  two  equal  period  pseudorandom  noise  (PN)  codes.  The 
C/A- code  is  a  Gold  sequence  with  chipping  rate  at  1.023  Mchips/sec  and  period  1  msec  (i.e.  code-length  1023 
chips),  while  the  P-code  is  a  PN  code  with  chipping  rate  at  10.23  Mchips/sec  and  truncated  period  of  one 
week.  The  Li  carrier  involves  both  C/A-signals  and  P-signals  while  the  L2  carrier  involves  only  P-signals. 
The  use  of  P-signals  is  limited  to  authorized  DoD  users.  On  the  other  hand,  C/A-signals  are  available  for 
civilian  applications  and  are  critical  in  military  applications  as  well,  since  their  demodulation  facilitates  the 
acquisition  of  P-signals.  The  C/A  and  P  code  are  modulated  by  binary  navigation  data  (at  50  bps)  and  then 
multiplexed  in  phase  quadrature  to  form  the  satellite  signal.  In  this  chapter  we  focus  on  the  C/A  component 
of  the  transmitted  GPS  signal  (or  as  commonly  stated,  we  assume  perfect  separation  of  the  C/A  and  P-signal 
component  at  the  receiver). 

The  working  principle  of  the  GPS  system  is  simple:  The  position  of  a  GPS  receiver  is  modeled  as  a  four¬ 
dimensional  vector  (three  coordinates  correspond  to  the  spatial  position  of  the  receiver  and  one  is  related 
to  receiver  timing).  Estimation  of  the  position  can  be  achieved  by  utilizing  the  signals  of  a  minimum  of 
four  satellites  (or  equivalently,  by  solving  a  system  of  four  equations  with  respect  to  four  unknowns,  namely 
the  three  spatial  position  coordinates  and  the  receiver  clock  bias).  If  additional  receiver  parameters  such  as 
velocity  and/or  acceleration  are  of  interest  then  a  higher  dimension  vector  is  required  to  model  the  system. 

The  signal  captured  by  a  GPS  receiver  is  the  aggregate  of  the  GPS  signals  of  the  satellites  that  are 
currently  in  view,  their  multipaths,  additive  white  Gaussian  noise  (AWGN)  due  to  front-end  receiver  elec¬ 
tronics,  possible  intelligent  hostile  spread-spectrum  interference  (spoofing),  other  wideband  and  narrowband 
interference  (we  note  that  the  only  type  of  wideband  interference  considered  in  this  work  is  SS  interference 
while  narrowband  interference  is  not  considered  at  all;  extensive  treatment  of  the  problem  of  narrowband 
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interference  suppression  can  be  found  in  [9]- [14]).  The  component  of  the  received  signal  that  is  due  to  the 
GPS  signals  of  the  satellites  currently  in  view  is  the  superposition  of  very  low  correlated  SS  signals  due  to 
the  fact  that  Gold  sequences  are  utilized  [15],  [16].  We  recall  that  the  cross-correlation  of  the  C/A- codes 
assumes  three  values,  namely  jAg,  pUp  and  A|g,  all  related  to  shifts  at  integer  multiples  of  the  chip  dura¬ 
tion.  However,  an  intelligent  SS  interferer/spoofer  who  knows  the  satellite  C/A-code  can  synchronize  to  the 
satellite  signal  of  interest  (at  its  own  location),  mimic  the  satellite  transmission  and  thus  severely  corrupt 
the  signal  of  interest  by  contributing  highly  correlated  (with  the  signal  of  interest)  SS  interference.  Indeed, 
simple  geometric  calculations  can  show  that  if  the  difference  of  the  direction-of-arrival  (DoA)  of  the  jamming 
signal  and  that  of  the  satellite  signal  is  less  than  0.27T,  then  as  long  as  the  spoofer  is  within  1500m  from  the 
GPS  receiver,  the  jamming  SS  signal  lags  the  satellite  signal  of  interest  by  a  fraction  of  a  chip  period.  The 
latter  condition  can  result  in  high  signature  correlation  between  the  jamming  SS  signal  and  the  SS  satellite 
signal  of  interest  at  the  receiver  (we  recall  that  the  auto-correlation  of  integrate-and-dump  Gold  SS  signals 
for  lags  less  than  a  chip  period  can  be  high)1. 

Demodulation  of  the  navigation  data  of  the  satellites  currently  in  view  can  be  achieved  either  by  utilizing 
a  bank  of  single-satellite  detectors,  or  by  using  a  fixed  decision  device  designed  to  simultaneously  detect  the 
data  of  a  number  of  specific  satellites  [1],  [17],  [18].  The  former  receiver  architecture  appears  better  suited 
to  the  GPS  environment  since  the  number  of  satellites  in  view  changes  with  time. 

In  the  presence  of  highly-correlated  SS  interference,  adaptive  temporal  processing  may  not  be  sufficient 
by  itself  to  achieve  satisfactory  interference  suppression.  The  use  of  an  antenna  array  allows  the  receiver 
to  operate  in  the  spatial  domain  in  addition  to  the  temporal  (code)  domain  and  hence  exhibit  improved 
interference  suppression  capabilities.  On  the  other  hand,  there  are  cases  where  interference  suppression  can 
be  achieved  through  adaptive  spatial  processing  only  (followed  or  preceded  by  temporal  matched  filtering). 
However,  when  cost  and  size  requirements  limit  the  degrees  of  freedom  provided  by  the  antenna  array, 
then  adaptive  spatial  processing  alone  might  not  be  sufficient  for  effective  interference  suppression.  In  such 
cases  adaptive  temporal  processing  in  addition  to  spatial  processing  is  required  (either  jointly  or  disjointly 
optimized).  In  this  chapter,  we  address  the  problem  of  navigation  data  demodulation  by  an  adaptive  GPS 
receiver  that  utilizes  a  bank  of  single-satellite  linear  tapped-delay  filters  and  employs  antenna-array  reception 
[19],  [20];  we  investigate  disjoint- domain  as  well  as  joint-domain  space-time  GPS  signal  processing  techniques. 

The  receiver  processors  considered  in  this  work  are  in  the  form  of  conventional  matched-filters  (MF) 
[21],  [22],  as  well  as  interference  suppressing  minimum- variance-distortionless-response  (MVDR)  filters  [23], 
[24]  and  auxiliary- vector  (AV)  filters  [25],  [26].  The  rationale  behind  the  selection  of  the  above  processors 
is  as  follows.  MVDR  filtering  is  a  generalization  of  minimum-mean-square-error  (MMSE)  filtering  (the 
former  reduces  to  the  latter  when  the  constraint  vector  is  chosen  to  be  the  statistical  cross-correlation 
between  the  filter  input  and  the  desired  response).  Thus,  the  MVDR  treatment  covers  MMSE  filtering  as  a 
special  case.  On  the  other  hand,  AV  filters  provide  a  viable  solution  when  filter  estimation  with  short  data 
records  is  performed.  We  recall  that  the  AV  algorithm  provides  a  sequence  of  filters  that  converges  to  the 
linear  MMSE/MVDR  optimum  solution  for  any  positive  definite  input  auto-correlation  matrix.  When  filter 
estimation  with  short  data  records  is  performed,  the  early,  non-asymptotic  elements  of  the  sequence  of  AV 
filter  estimators  provide  favorable  bias/variance  balance  and  outperform  in  mean-square  filter  estimation 
error  (constraint)  LMS,  RLS,  sample-matrix-inversion  (SMI),  diagonally  loaded  SMI  (DL-SMI),  and  multi¬ 
stage  nested  Wiener  filter  estimators  [27].  Maximum- likelihood  (ML)  optimum  receivers  are  not  considered 
in  our  studies  because  they  require  information  that  is  unknown  to  the  receiver  (i.e.  signatures  of  all 
active  SS  interfering  signals  and  all  energy  terms)  and  they  exhibit  unacceptable  computational  complexity 
(exponential  in  the  number  of  active  SS  signals).  Space-time  processing  in  the  context  of  direct-sequence 
code-division-multiple-access  (DS-CDMA)  communications  with  symbol-rate  periodic  spreading  has  been 
extensively  studied  in  the  literature  (a  representative  survey  chapter  is  [28]).  In  this  present  work,  the 
proposed  GPS  receiver  structures  utilize  filters  that  operate  at  a  fraction  of  the  navigation  data  bit  period  (1 
msec)  and  are  followed  by  hard-decision  detectors.  Soft  pre-detection  measurements  or  hard  post-detection 
decisions  taken  over  a  navigation  data  bit  period  are  then  combined  according  to  a  simple  combining  rule 
for  further  bit-error-rate  (BER)  performance  improvements. 

The  focus  of  this  chapter  is  on  interference  resistant  GPS  receivers.  The  performance  metrics  associated 
with  interference  resistance  characteristics  are  receiver  output  SINR  and  BER.  Accurate  data  demodulation 

*We  note  that  if  the  distance  parameter  value  is  less  than  1500m  then  even  higher  DoA-separation  parameter 
values  can  result  in  highly  correlated  jammers  with  the  GPS  signal  of  interest. 


141 


is  fundamental  to  the  estimation  of  the  location  of  the  GPS  receiver  since  both  satellite  position  information 
and  time  stamps  are  extracted  from  the  received  navigation  message.  A  GPS  message  is  transmitted  in  data 
frames.  Each  data  frame  consists  of  five  subframes  while  each  subframe  contains  ten  30-bit  words.  In  each 
data  frame,  the  first  subframe  provides  information  for  clock  correction  while  the  second  and  third  subframes 
provide  ephemeris  data  which  are  used  for  the  calculation  of  the  satellite  position.  The  probability  that  either 
the  satellite  position  estimate  or  the  timing  estimate  is  in  error,  Pr,  can  be  assessed  via  the  word  error  rate 
in  the  first  three  subframes.  Accounting  for  the  effects  of  block  coding  ((32,  26)  Hamming  block  codes  are 
used  by  current  GPS  systems),  then  the  30- bit  word  error  rate,  Pw,  can  be  evaluated  (approximately)  by 
Pw  ft  1  —  (1  —  Pe) 30  —  30Pe(l  —  Pe)29,  where  Pe  denotes  BER.  Then,  the  probability  that  the  range  estimate 
is  in  error  is  given  by  Pr  =  1  —  (1  —  Pw)30.  This  implies  that  when  Pe  =  0.01,  Pr  ft  0.67,  while  when 
Pe  =  0.1,  Pr  ft!  1.  Clearly,  Pr  is  a  global  metric  that  indicates  the  effect  of  BER  on  the  error  event  of 
the  range  estimation  process.  To  further  quantify  this  error,  detailed  analysis  is  needed  that  associates  the 
receiver  output  SINR  with  some  performance  metric  (e.g.  variance)  of  the  range  estimator.  Such  analysis  is 
beyond  the  scope  of  our  work. 

The  rest  of  this  chapter  is  organized  as  follows.  The  signal  model  is  given  in  Section  10.2.  Disjoint 
space-time  processing  algorithms  are  presented  in  Section  10.3  and  are  followed  by  joint  domain  processing 
algorithms  in  Section  10.4.  Combining  techniques  and  multipath  fading  phenomena  are  addressed  in  Sec¬ 
tion  10.5  and  Section  10.6,  respectively.  In  Section  10.7  filter  estimation  issues  are  considered.  Numerical 
and  simulation  results  are  given  in  Section  10.8  and  some  concluding  remarks  are  drawn  in  Section  10.9. 


10.2  Signal  Model 

The  C/A  and  the  P-signal  component  that  modulate  the  L\  carrier  are  assumed  to  be  perfectly  separated 
at  the  receiver.  Then,  the  signal  model  related  to  the  aggregate  C/A  component  of  the  received  signal  can 
be  viewed  as  an  asynchronous  DS-SS  system  with  I\  SS  signals  in  the  presence  of  AWGN.  Since  the  satellite 
navigation  information  bit  rate  is  50  bps,  it  is  equivalent  to  say  that  the  C/A  code  of  each  GPS  signal  repeats 
itself  20  times  per  bit  and  during  this  period  it  is  modulated  by  the  same  data  bit.  In  this  context,  the 
contribution  of  the  fcth  transmitted  GPS  signal,  A:  =  1,  2,  •  •  •  ,  I\,  over  T  =  1  msec  is  given  by 

Mt)  =YJb^)\fEksk{t-iT)e^^t+^  (10.1) 


where  bk(i)  6  {  —  1,  +1}  is  the  ith  transmitted  data  bit  such  that  bk(i)  =  bk(i  +  to),  m  =  1,  2,  •  •  •  ,  19,  and 
bk(i)  is  independent  of  bk(j)  for  all  I*  ~j I  >  20.  Also,  Ek  and  4>k  denote  the  energy  and  the  carrier  phase, 
respectively,  of  the  kth  signal,  while  Sk  is  the  normalized  fcth  C/A  code  given  by 

L—l 

Sk(t)  =  ^2  dk(l)ip(t  -  ITC).  (10.2) 

1=0 


In  (10.2),  <4(0  G  {±1  /VL},  l  =  0, 1,  •  •  •  ,  L  —  1,  are  the  signature  chips  of  the  fcth  SS  signal  and  ip(t)  is  the 
chip  waveform  of  duration  TC  =  T / L. 

The  aggregate  signal  received  at  the  input  of  a  narrowband  uniform  linear  array  of  M  antenna  elements 
is  given  by 

K 

'x-c{t)  =  'Y^uk(t-Tk)sLk  +  n(t)  (10.3) 

k= 1 

where  n(0  is  M-dimensional  complex  AWGN,  rk  is  the  user  time  delay  and  ak  is  the  array  response  vector 
of  the  fcth  SS  signal  with  elements 


aJm]  =  ei27r(m“1)! 


to  =  1,  2,  •  ■  *  ,  M. 


(10.4) 


In  (10.4),  6k  denotes  the  angle  of  arrival  of  the  fcth  signal,  A  is  the  carrier  wavelength  and  d  is  the  inter¬ 
element  spacing  (usually  d  =  A/2).  For  simplicity  of  presentation,  multipath  considerations  and  related 
signal  model  generalizations  are  deferred  to  Section  10.6. 
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After  carrier  demodulation  the  received  signal  is  given  by 

K 

*(‘)  =  EE  bk (^) y/ Xiksk (t  %T  Tk^c  ^  ^  ak  T  n(A).  (10.5) 

i  k=l 

Without  loss  of  generality,  we  assume  chip  synchronization  at  the  reference  antenna  element  ( m=l )  with 
the  SS  signal  of  interest,  say  signal  1 ,  and  we  also  assume  0  <  Tk  <  T,  A:  =  2, 3,  •  •  •  ,  K.  After  conventional 
chip  matched  filtering  and  sampling  at  the  chip  rate  1  /Tc,  we  can  visualize  the  space-time  data  samples 
associated  with  b\(i)  in  the  form  of  an  M  x  L  matrix  X 

XHxi(»)  =  [xMxi(*i)  xMxl(iL+l)  •••xMxl(iL+L-l)]  (10.6) 

where  the  column  vector  xMxl(iL  +  j)  associated  with  the  jth  chip-sample  within  the  ith  information  bit 
period  of  signal  1  is  given  by 

K 

Xmxi(*£+j)  =  ^2h{i)V^kdk(j-Tk/Tc)e~j27rfaTkak  + 
k= 1 

nMxi {iL+j),  j  =  0, 1,-  -  •  ,L  -  1  (10.7) 


or  equivalently, 

xMxi(*^+i)  =  +  IMxl(iL+j)  + 

nMxi (iL+j),  j  =  0, 1,  •  •  •  ,L  —  1  (10.8) 

where  IMxl(iL  +  j)  =  ^k=2  bk ( i)\/Ekdk ( j  —  rk/Tc )  e~:’2n^aTkak  denotes  comprehensively  multiple  access 
interference  (MAI)  and  nMxl(zL+j)  is  an  AWGN  noise  vector  with  covariance  matrix  cr2XMxM  (IMxM  denotes 
the  MxM  identity  matrix).  In  the  following,  we  pursue  one-shot  detection  of  the  bit  of  interest  bi(i)  and 
we  drop  the  index  i  for  simplicity  in  notation. 

We  note  that  under  the  signal  model  characteristics  presented  in  this  section,  as  well  as  the  GPS  receiver 
architecture  adopted  in  this  work  (a  bank  of  single-satellite  detectors),  the  focus  is  on  the  detection  of  the 
navigation  data  of  a  single-satellite.  In  this  context  and  for  the  single-satellite  detector  under  consideration, 
the  GPS  signals  of  other  satellites  currently  in  view  as  well  as  possible  intelligent  SS  “GPS-looking”  jamming 
signals  can  be  comprehensively  incorporated  into  the  SS  interference  term  I.  Besides  the  power  advantage 
of  spoofing  signals  over  satellite  signals,  the  difference  between  the  former  and  the  latter  SS  signals  lies 
in  their  corresponding  signature  cross-correlation  level  with  the  satellite  signal  of  interest  (high  and  low 
correlated  SS  interference,  respectively).  To  justify  the  above  observation  it  suffices  to  recall  that  Gold 
sequences  have  low  cross-correlation  while  the  auto-correlation  of  integrate-and-dump  Gold  SS  signals  for 
lags  less  than  one  chip  period  can  be  high.  As  an  example,  suppose  now  that  the  distance  between  the 
satellite  and  the  GPS  receiver  is  h  =20162  km,  the  distance  between  the  jammer  and  the  GPS  receiver  is 
r,  and  the  angle  between  the  DoA  of  the  jamming  signal  and  the  DoA  of  the  satellite  signal  is  A 9.  Then, 
by  simple  geometric  calculations,  we  can  evaluate  the  distance  between  the  satellite  and  the  jammer  by 
l  =  \Jh2  +  r2  —  2hrcos(A9).  If  Ad  <  0.27T,  then  l  +  r  —  h  <  300  m  as  long  as  r  <  1500m,  which  implies 
that  the  jamming  signal  arrives  at  the  GPS  receiver  within  Tc  time  after  the  direct  satellite  signal.  Thus, 
the  jammer  contributes  strong  interference  from  as  far  away  as  1500m  from  the  GPS  receiver. 


10.3  Disjoint  Domain  Filtering 

In  this  section  we  investigate  several  disjoint  domain  filtering  configurations  for  GPS  signal  processing  that 
are  formed  by  the  cascade  of  a  spatial  filter  followed  by  a  temporal  filter,  or  the  cascade  of  a  temporal 
filter  followed  by  a  spatial  filter.  In  this  context,  disjoint  domain  receiver  design  is  a  two  stage  process 
with  the  second  stage  being  conditioned  on  the  design  of  the  first  stage.  The  design  optimization  criterion 
imposed  at  either  stage  (regardless  of  the  spatial  or  temporal  nature  of  the  corresponding  filter)  that  is 
of  interest  in  this  work  is  of  MF-type,  MVDR-type  or  AV-type.  The  design  optimization  criterion  for  each 
stage  can  be  selected  independently  and  is  usually  dictated  by  considerations  of  simplicity  in  implementation, 
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computational  complexity  and  performance.  On  the  other  hand,  the  space-time  or  time-space  order  by  which 
a  signal  is  processed  may  be  influenced  by  the  objective  of  a  particular  application.  We  may  expect,  for 
example,  that  in  cases  where  DoA  estimation  is  critical,  a  temporal-domain  processed  received  signal  is  less 
interference-corrupted  than  the  non-processed  received  signal  and  thus  it  can  provide  a  better  input  to  a 
DoA  estimation  algorithm  (whose  output  may,  in  turn,  be  needed  for  subsequent  spatial-domain  processing). 


10.3.1  Space-Time  (S-T)  Configuration 

The  general  structure  of  the  space-time  cascade  receiver  configuration  is  shown  in  Fig.  10.1.  For  an  arbitrary 
linear  spatial  processor  ws  and  an  arbitrary  linear  temporal  processor  wt  the  decision  on  the  information 
bit  of  the  signal  of  interest  bi  is  given  by  the  following  expression, 

b\  =  sgn  (Re{ wf  (wf  X)T})  =  sgn  (Re{wf  Xw('})  (10.9) 

where  sgn(-)  identifies  the  sign  operation  (0-threshold  hard  limiter),  superscript  H  denotes  the  Hermitian 
operator,  and  i?e{- }  extracts  the  real  part  of  a  complex  number.  We  recall  that  X  is  the  complex  data 
matrix  of  size  M  x  L.  In  the  following  we  present  the  different  forms  that  ws  and  wf  may  assume. 

Spatial  Matched-Filtering  (sMF) 

The  spatial  matched  filter  is  the  filter  matched  to  the  array  response  vector,  ai,  of  the  signal  of  interest, 
that  is 

w3MJt  =  ai/M.  (10.10) 

We  observe  that  -EbjfX&idi}  =  yTiai;  where  di  is  the  signature  of  the  signal  of  interest  and  the  statistical 
expectation  operation  E{-}  is  taken  with  respect  to  bi  only.  Although  estimation  of  the  array  response  vector 
is  not  the  primary  subject  of  this  work,  we  notice  that  the  above  expectation  implemented  in  the  form  of 
sample  averaging  can  serve  as  a  supervised  estimator  of  the  array  response  vector  of  the  signal  of  interest: 

1  ^ 

Si  =  jEXA(i)di  (10.11) 

3= 1 

where  {Xj}f=1  is  a  sequence  of  J  M  x  L  matrices  of  received  data.  We  recall  that  any  subsequent  linear 
filtering  followed  by  0-threshold  hard  limiter  is  positive  scalar  invariant  and  thus  any  scalar  that  multiplies 
ai  does  not  affect  the  final  output  performance.  We  also  note  that  the  scalar  1/M  in  (10.10)  preserves 
the  energy  of  the  signal  of  interest  at  the  output.  The  latter  is  merely  a  convenience  consideration  for  the 
performance  analysis  and  comparisons  presented  in  subsequent  sections. 

The  output  of  the  filter  weMF  is  given  by 

yMf  =  =  [Vmf,1  Umf,  2  ■■■  Vmf,l\  (10.12) 

Vmfj  =  wf L*-Xmx1(j)  =  \/rEib1d1(j)  +  af  I  (j)/M 

+afn (j)/M,  j  =  1,2,  ■■  ■  ,  L.  (10.13) 


□ 

Spatial  MVDR  Filtering  (sMVDR) 

The  MVDR  filter  is  designed  to  minimize  the  filter  output  variance  subject  to  the  constraint  that  the  filter 
remains  distortionless  in  the  vector  direction  of  the  signal  of  interest  ai  [23].  The  filter  is  given  by 


W  ,MV 


R-s  lai 

af  R7xai 


(10.14) 


where  Rs  denotes  the  M  x  M  covariance  matrix  of  the  columns  of  XMxI/  (i.e.  the  correlation  of  the  spatial 
input  data).  The  output  of  the  MVDR  filter  is  given  by 

T  _  H  v 
y  MV  ^sMV^xL 

=  [|/mv,1  Vmv,  2  ***  Vmv,l\  (10.15) 
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where 


V  MV,  j  —  WsMVXMxl  O')  =  y/ E\b\d\  (j)  + 


afRr1!^) 

af  R«_1ai 


gf glVj) 

a^RJ1a1  ’ 


i  =  1,2,- 


,L. 


(10.16) 


□ 


Spatial  Auxiliary-Vector  Filtering  (sAV) 

For  a  given  spatial  covariance  matrix  Rs,  the  theory  of  AV  filtering  [26],  [27]  can  be  applied  to  the  spatial 
domain  and  provide  an  infinite  sequence  of  spatial  linear  filters  that  are  distortionless  in  the  direction  of  the 
signal  of  interest  and  can  be  obtained  by  the  following  recursion: 


"W(0)=IMI2 

(10.17) 

For  n=  1,  2,  ■  ■  ■ 

,  N  -D  ,  ,,  afR5wsAV(n-l)a1 

gs(n)  =  TlswsAV(n  1) 

llai|| 

(10.18) 

..  ^  _  gf  (n)RsvfaAV(n-l) 

/,S  n  g? (n)Rsgs(n) 

(10.19) 

wSj4 v  (n)  =  wSj4 v  (0)  -  ^2  Ts  (i)gs  (■ i ) 

(10.20) 

In  the  above  recursive  procedure,  the  sequence  of  auxiliary  vectors  g s(n)  is  selected  to  maximize  the  cross¬ 
correlation  between  the  wSj4v(n— 1)  processed  data  and  the  g s(n)  processed  data  while  pn  is  chosen  to 
minimize  the  variance  at  the  output  of  the  filter  w sAV(n).  It  is  important  to  note  that  all  auxiliary  vectors 
gs(l),  gs(2),  •  •  •  are  constrained  to  be  orthogonal  to  ai  but  orthogonality  among  them  is  not  imposed.  As 
shown  in  [27],  the  sequence  of  auxiliary- vector  filters  converges  to  weMV  (wbAV(ji)—. >wsMV  as  n— too).  The 
AV  filter  output  when  n  auxiliary  vectors  are  utilized  is  given  by 

Yav  =  wlWXMxf  (10.21) 


□ 

Finally,  the  temporal  processor  of  the  space-time  configuration  is  a  linear  filter  of  dimension  L  x  1 
that  takes  as  input  the  spatial-processor  output  (given  by  (10.12)  or  (10.15)  or  (10.21)).  Similar  to  the 
optimization  criteria  used  in  the  design  of  the  spatial  processor,  the  temporal  filter  can  be  of  MF-type, 
MVDR-type  or  AV-type. 

Temporal  Matched-Filtering  (tMF) 

The  temporal  MF  is  given  by 

wtMF  =  di  (10.22) 

or  equivalently,  w tMF  =  Eb1{ybi},  where  the  temporal  filter  input  y  is  either  y  =  yMF  or  y  =  yMV  or 

y  =  y^v-  □ 

Temporal  MVDR-Filtering  (tMVDR.) 

The  temporal  MVDR  filter  can  be  shown  to  be  equal  to 


WtMF 


Rr'd, 

dfR-'d, 


(10.23) 


where  Rt  is  the  L  x  L  covariance  matrix  of  the  spatial  filter  output  y  and  thus  it  depends  on  the  type  of 
first-stage  spatial  processing.  □ 
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Temporal  Auxiliary -Vector  Filtering  (tAV) 

The  sequence  of  auxiliary  vector  filters  in  the  temporal  domain  can  be  obtained  as  follows, 


'ww(O)  =di 

For  n  =  1,  2,  -  •  • 


n  =  1,  2,  -  -  - 

g t(n)  =  Rtwtjlv(n-1)  -  df  Rtwtjtv(n-l)dj 
_  gf  (n)Rtwtjlv(n-l) 
gf(n)R ,St(n) 


W  ...  1  Til 


(10.24) 

(10.25) 

(10.26) 

(10.27) 


and  R*  denotes  the  covariance  matrix  of  the  spatial  filter  output  y  (which  again  depends  on  the  type  of  the 
first-stage  spatial  processor).  □ 


10.3.2  Time-Space  (T-S)  Configuration 

The  time-space  (T-S)  cascade  configuration  is  the  dual  alternative  to  the  space-time  cascade  configuration 
examined  in  the  previous  section.  The  general  T-S  cascade  structure  is  depicted  in  Fig.  10.2  where  zMxl 
and  y  denote  the  output  of  the  temporal  and  spatial  filter  wt  and  ws,  respectively.  Both  filters  can  be 
designed  according  to  MF,  MVDR  or  AV  processing  principles  in  a  dual  manner  to  the  S-T  configuration  of 
the  previous  section.  It  is  straightforward  to  show  that  the  S-T  configuration  sMF /tMF  is  equivalent  to  the 
T-S  configuration  tMF /sMF.  This  is  not,  however,  the  case  when  MVDR  or  AV  filters  are  involved. 

With  all  these  design  alternatives  available  for  both  disjoint  S-T  and  T-S  configurations,  a  natural 
question  arises  as  to  what  their  relative  merits  are.  We  investigate  this  issue  in  the  following  section. 


10.3.3  Performance  Analysis  of  Disjoint  Processing  Schemes 

For  convenience  in  presentation  we  study  a  hypothetical  system  consisting  of  only  two  spread  spectrum 
signals,  say  one  GPS  signal  of  interest  and  one  “GPS-looking”  jamming  signal.  Our  objective  is  to  draw 
theoretical  conclusions  about  the  relative  merits  of  all  different  design  alternatives  presented  in  sections 
10.3.1  and  10.3.2.  The  performance  criterion  that  we  consider  for  a  given  design  is  the  signal-to-interference- 
plus-noise-ratio  (SINR)  at  the  output  of  the  overall  configuration.  The  results  can  be  generalized  to  account 
for  the  presence  of  more  GPS  and  jamming  signals.  However,  the  mathematical  expressions  involved  are 
prohibitively  complicated  with  the  latter  usually  at  the  expense  of  clarity  in  presentation.  The  analysis  that 
follows  involves  performance  comparisons  for  MF-type  and  MVDR-type  filter  combinations  only.  Disjoint 
auxiliary-vector  filtering  analysis  renders  itself  quite  complicated  and  thus  it  is  not  considered  in  this  work. 
The  comparisons  are  summarized  in  the  following  theorem.  The  proof  is  included  in  the  appendix. 

Theorem  10.1  Let  a.;  and  d;  be  the  spatial  and  temporal  signature  of  signal  i,  i  =  1,  2,  of  length  M  and  L, 
respectively.  Let  p  and  p  be  defined  as  p  =  |d^d2|,  and  rj  =  aijv^2  ,  respectively.  Let  also  r*  0  denote  the 

input  signal-to-noise-ratio  (SNR)  of  signal  i,  and  SINR(sXX/tYY)  or  SINR(tYY /  sX  x)  denote  the  output  SINR. 
of  an  S-T  or  T-S  configuration,  respectively,  that  utilizes  an  xx-type  spatial  filter  and  a  YY-type  temporal 
filter.  Then, 

A.  Space-Time  configuration 

(i)  SINRfsMF/tMF)  <  SINR(sMF/tMv) ,  with  equality  if  and  only  if  p  E  {0,1}  or  r/  =  0, 

(ii)  SINR(sMV/tM f)  <  SINR(sMV/tMv) ,  with  equality  if  and  only  if  p  £  {0, 1}  or  p  =  0, 

(Hi)  SINR.(sMF/tMF )  >  SINR(sMV/tMF)  if  and  only  if 

772(1  -  V2)  [1  -  MT2p2(l  -  r) 2)  -  2 Lp2]  >  0,  (10.28) 

(iv)  A  loose  sufficient  condition  for  SINR(sMV/tMv)  <  SINR(sMF/tMv)  is 

l  +  MT2p2(l-  p2)  > 
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2 


(10.29) 


Miyg-p2) 

+  1+^(2+a«j1)(i_7?2) 

(v)  SINR(sMF/tMv)  >  SINR(sMV/tMF)  if  and  only  if 

lVp2(i-p2)> 

(1— P2)  [Mr2p2(l-772)+2Lp2-l-772(l-p2)] . 


.+^(1V) 


B.  Time-Space  configuration 

(i)  SINR(tMF/sMF)  <  SINRftMF / sMv) ,  with  equality  if  and  only  if  p  E  {0, 1}  or  p  =  0, 

(ii)  SINR(tMV/sMF)  <  SINRftMV / sMv) ,  with  equality  if  and  only  if  r\  E  {0, 1}  or  p  =  0, 
(Hi)  SINR(tMF/sMF)  >  SINR(tMV/sMF)  if  and  only  if 

p2(  1  -  p2)  [1  -  MV2p2(l  -  p2)  -  2 Mp2]  >  0, 


(iv)  A  loose  sufficient  condition  for  SINR(tMV / sMv)  <  SINR(t.M f / sMv)  is 

1  +  MT2p2(l  -  rj2)  > 

MT2p2(l-p2)) 


1+ 


i+r2(2+r2)(i-p2) 


[i+r2(i— p2)] " 


(v)  SINR{tM f / sMv )  >  SINR(tMV / sM f)  if  and  only  if 

M2pV(l  -rj2)  > 

(i-p2)[Mr2p2(i-p2)+2Mp2-i-Mr2p2(i-p2)]- 


C.  Space-Time  versus  Time-Space  configuration 

(i)  SINR(sMF/tMF)  =  SINR(tMF/sMF),  for  any  p,p, 

(ii)  SINR(sMV/tM f)  <  SINR.(tMF/sMv),  with  equality  if  and  only  if  r)  E  {0, 1}  or  p  =  1  /\/L, 
(Hi)  SINRftMV/ sMf)  <  SINR(sMF/tMv),  with  equality  if  and  only  if  p  E  {0, 1}  or  p  =  1  / \/~M . 


(10.30) 


(10.31) 


(10.32) 


(10.33) 


□ 

A  few  observations  based  on  the  above  theorem  now  follow:  (a)  Under  the  assumption  of  AWGN  and 
interference  that  is  independent  of  the  signal  of  interest,  the  output  interference-plus-noise  variance  of  any 
cascade  configuration  of  linear  filters  above  can  be  expressed  as  the  sum  of  two  terms,  one  due  to  the 
interference  only  and  the  other  due  to  noise  only,  (b)  S-T  and  T-S  are  dual  configurations  (hence  results 
in  Part  B  are  simply  the  dual  of  results  in  Part  A,  so  in  the  following  we  address  Part  A  and  Part  C  only). 

(c)  When  p  =  0  or  1,  wtMV  degenerates  to  w tMF.  Similarly,  when  rj  =  0  or  1,  woMV-  degenerates  to  w3MJ?. 

(d)  The  output  SINR  increases  as  the  number  of  antenna  elements  increases,  (e)  An  MVDR  structure  at 
the  second  stage  suppresses  the  overall  output  interference-plus-noise  variance  while  an  MVDR  structure  at 
the  first  stage  followed  by  an  MF-structure  at  the  second  stage  suppresses  interference- plus-noise  variance  of 
that  first  stage  only,  (f)  We  understand  that  Item  (iii)  of  Part  A  may  appear  somewhat  unexpected  at  first 
glance.  To  add  an  intuitive  justification/explanation  of  the  validity  of  this  statement,  we  consider  a  simple 
example  where  p  =  0  and  0  <  rj  <  1.  In  this  case,  the  interference  is  completely  suppressed  after  temporal 
MF  of  the  second  stage  in  both  sMF/tMF  and  sMV/tMF  configurations.  However,  the  variance  component 
due  to  AWGN  only  is  higher  at  the  output  of  the  spatial  sMv  than  sMF  of  the  first  stage.  This  filtered  noise 
term  which  participates  in  the  input  to  t.MF  in  both  configurations  leads  to  the  output  SINR  relationship  of 
A(iii).  Similar  justification  can  be  provided  for  A(iv).  We  note  that  the  output  SINR.  of  the  MVDR  filter 
depends  on  the  input  SINR.  as  well  as  on  the  input  interference-to- noise  ratio  (INR).  In  other  words,  for  a 
given  set  of  SS  signal  signatures  and  a  given  input  SINR,  the  output  SINR.  of  the  MVDR.  filter  may  be  higher 
in  the  case  of  high  input  INR  than  in  the  case  of  low  input  INR.  A  complete  numerical  study  with  respect 
to  this  issue  is  carried  out  in  Fig.  10.3  for  a  GPS  system  with  the  signal  of  interest  at  15  dB  and  one  “GPS 
looking”  jamming  signal  at  30  dB.  Perfect  knowledge  of  the  pertinent  auto-covariance  matrices  is  assumed. 
We  observe,  for  example,  that  if  p  >  0.25  then  SINR(sMV/tMv)  <  SENR(sMF/tMv)  for  p  <  0.7. 
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10.4  Joint  Domain  Filtering 

For  joint  domain  processing,  to  avoid  cumbersome  2-D  operations  and  notation,  we  vectorize  the  matrix 
XHx[  by  stacking  all  columns  in  the  form  of  a  vector 

dWi  =  V  ec{XMx£,}.  (10.34) 

In  the  following,  X  denotes  the  joint  space-time  data  in  the  CML  complex  vector  space  that  constitutes 
the  input  to  a  joint  space-time  linear  filter  w  to  be  designed  according  to  MF,  MVDR  or  AV  processing 
principles.  The  general  structure  of  the  joint  space-time  receiver  is  depicted  in  Fig.  10.4. 

Joint  Domain  Matched-Filtering  (JMF) 

The  joint  space-time  matched  filter  for  the  signal  of  interest  ( signal  1 )  is  the  joint  space-time  signature  of 
the  signal  of  interest,  that  is,  the  Kronecker  product  vi  =  (di  (g>  ai)/M,  where  di  and  ai  are  the  temporal 
code  and  spatial  code  (steering  vector)  of  the  signal  of  interest,  respectively.  Similar  to  the  developments 
of  Section  10.3,  v(  is  also  given  as  the  cross-correlation  between  the  received  space-time  vector  X  and  the 
desired  information  bit  b\ ,  that  is, 


w jmf  =  vi  =  Ebl{Xbi}/M  =  (di  <S>  &i)/M 


(10.35) 


where  the  scaling  by  1/M  is  done  without  loss  of  generality  for  consistency  purposes  in  subsequent  theoretical 
and  simulation  comparisons. 

With  w jmf  defined  by  (10.35),  it  is  straightforward  to  see  that  the  JMF  filter  is  equivalent  to  the 
disjoint  MF /MF  configurations.  We  also  note  that  JMF  (and  the  equivalent  cascade  MF /MF  designs  for  that 
purpose)  is  mean-square  optimum  only  when  the  multiple  access  interference  and  the  external  disturbances 
are  white  Gaussian,  which  is  not  the  case  for  most  practical  SS  communication  systems.  Indeed,  non- 
orthogonal  multiple  access  interferers  as  well  as  highly  correlated  (with  the  signal  of  interest)  hostile  jammers 
may  render  the  JMF  receiver  ineffective.  A  remedy  for  the  latter  situation  is  to  proceed  with  the  design  of 
the  near-far  resistant  MVDR  filter  receiver  or  the  AV  filter  receiver  that  are  presented  below. 

Joint  Domain  MVDR  Filtering  (JMVDR) 

The  joint  domain  MVDR  filter  is  designed  to  minimize  the  variance  at  its  output  and  simultaneously  be 
distortionless  toward  vi,  the  joint  space-time  signature  of  the  signal  of  interest.  The  filter  is  given  by  the 
following  expression: 


WJMV 


R_1Vl 

v"R-'Vl 


(10.36) 


where  R  =  E{XXH}  is  the  covariance  matrix  of  the  space-time  data  input  vector. 

Joint  Domain  Auxiliary -Vector  Filtering  (JAV) 

Joint  domain  auxiliary- vector  filter  design  provides  a  sequence  of  joint  domain  auxiliary- vector  filters  that 
are  distortionless  toward  the  joint  space-time  signature  of  the  signal  of  interest  vi  and  can  be  obtained  by 
the  following  recursion: 


j^(°)-  ||Vi||2 

(10.37) 

n  =  1,2,--- 

/  \  -r)  ,  , .  vfRwJj4v(n-l)v! 

g(n)  =  Rwjwn-1 - i - n — - - — 

(10.38) 

gfl(n)RwJW(n-l) 

M n)  ~  SH (n)Rg(n) 

(10.39) 

jav  (n)  =  w  JAV  (0)  -  ^2  /•*(*) g(*) 

(10.40) 

Similar  to  the  corresponding  auxiliary-vector  filtering  developments  for  the  disjoint  configuration  of  Sec¬ 
tion  10.3,  the  joint-domain  auxiliary  vector  g  (n)  is  chosen  to  maximize  the  cross-correlation  Iw^^n— l)Rg(n)| 
while  the  scalar  fi(n)  is  chosen  to  minimize  the  output  filter  variance 

|w^4v(n)Rwjj4v(n)|,  n  =  1, 2,  -  -  - .  Convergence  analysis  of  the  auxiliary- vector  filter  sequence  can  be  found 
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in  [27].  Data- record-based  criteria  for  the  selection  of  the  best  AV  estimate  of  the  MVDR  filter  were  recently 
reported  in  [29]. 

As  a  final  comment  for  this  section,  we  note  that  while  disjoint  MF  processing  in  space  and  time  is  always 
equivalent  to  joint  MF  processing,  adaptive  joint-domain  (MVDR  or  AV)  processing  involves  ( ML  —  1) 
degrees  of  freedom  while  disjoint  processing  involves  (M  +  L  —  2)  [30].  As  a  result,  both  joint  MVDR 
and  joint  AV  processing  are  expected  to  outperform  the  corresponding  disjoint  configurations  that  involve 
MVDR  processing  or  AV  processing  in  both  space  and  time,  given  ideal  pertinent  statistics.  In  real  world 
applications,  however,  finite  data-record-size  puts  the  “curse  of  dimensionality”  on  joint-domain  processing, 
hence  we  may  find  disjoint-domain  configurations  outperform  the  joint-domain  counterpart  under  short  data 
support. 

10.5  GPS  Filter  Output  Combining 

Combining  techniques  have  been  extensively  studied  in  the  communications  literature  as  a  means  to  provide 
some  form  of  diversity  (a  representative  example  is  R  AKE  filtering  for  multipath  combining  [22],  [31]).  In  this 
section  we  take  advantage  of  the  redundancy  introduced  by  spreading  the  information  bit  into  20  C/A  code 
periods  and  implement  combining  methods  to  further  improve  the  receiver  BER  performance.  In  fact,  while 
in  initial  acquisition  and  demodulation  mode  this  feature  cannot  be  utilized,  once  bit  synchronization  has 
been  acquired  and  the  information  bit  boundary  has  been  detected  (usually  achieved  by  a  threshold  testing 
of  the  output  of  a  sliding  correlator),  we  can  combine  the  soft  or  hard  GPS  filter  outputs  by  using  either  (i) 
selective  combining  (SC)  or  (ii)  equal  gain  combining  (EGC)  or  (Hi)  maximum  ratio  combining  (MRC).  In 
SC,  we  combine  those  GPS  processor  outputs  whose  soft  (pre-detection)  outputs  are  above  a  pre-specified 
threshold.  Depending  on  the  variations  of  the  environment  and  the  velocity  of  the  receiver,  the  threshold 
needs  to  be  set  dynamically,  which  increases  significantly  the  complexity  of  the  method.  On  the  other  hand, 
EGC  is  the  simplest  method  where  all  20  GPS  filter  outputs  are  just  added  together  with  equal  gain  of  1. 
In  MRC,  the  GPS  filter  (soft  or  hard)  outputs  are  first  weighted  and  then  summed  such  that  each  weight  is 
proportional  to  the  corresponding  magnitude  of  the  GPS  filter  soft  output.  Adaptive  implementations  of  the 
MRC  method  that  appear  necessary  under  time  varying  conditions  of  the  environment  may  contribute  to 
additional  complexity.  As  mentioned  above,  all  these  methods  can  be  applied  to  either  the  soft  or  the  hard 
filter  outputs  with  the  understanding  that  pre-detection  combining  is  better  than  post-detection  combining 
in  terms  of  output  SINR. 

The  advantages  of  filter  output  combining  as  well  as  the  relative  SINR  merits  of  the  corresponding 
methods  described  above  appear  to  be  more  evident  under  fading  and  low  SNR  conditions.  To  illustrate  this 
and  as  a  precursor  to  our  simulation  studies,  we  may  consider  a  simple  scenario  where  due  to  fading  and  in 
the  absence  of  SS  interference  the  signal  component  at  all  but  one  (out  of  20)  filter  outputs  over  a  navigation 
data  bit  period  are  attenuated  by  the  multiplicative  constant  0.1.  Then  in  presence  of  AWGN  of  variance  <r2  a 
SC  method  that  selects  only  the  strongest  output  exhibits  output  SNR  equal  to  SNRsc  =  while  the  EGC 

method  and  the  MRC  method  exhibit  output  SNR,  respectively,  SNREGC  =  (Ae+w-oa-Ce)  _  q  ^2SNRsc 

and  SNRmrc  =  ( N^^1]9900112'ff ' ^  =  1.19 SNRsc,  where  E  is  the  energy  of  the  signal  component  at  the  filter 
output  under  no  attenuation  conditions.  Alternatively,  if  we  assume  that  due  to  fading  half  of  the  filter 
outputs  are  attenuated  by  the  multiplicative  constant  0.5,  then  SNRsc  =  SNREGC  =  11.25 SNRsc  and 
SNRmrc  =  12.5SNRSC. 

10.6  Multipath  Fading  Considerations 

Fading  phenomena  associated  with  mobile  satellite  communications  at  the  L-band  can  be  categorized  into 
long-term  fading  and  short-term  fading.  Long-term  fading  effects  in  signal  envelope  variations  are  modeled 
as  multiplicative  log-normal  random  variables  while  short-term  fading  effects  are  modeled  as  multiplicative 
complex  Gaussian  random  variables  that  introduce  a  Rayleigh  distributed  amplitude.  In  this  chapter  we 
consider  only  short-term  channel  variations  that  appear  more  difficult  to  accommodate. 

Short-term  multipath  fading  channels  involve  independent  fading  effects  across  all  resolvable  paths. 
The  presence  of  a  direct  path  (with  no  amplitude  distortion)  identifies  a  Rician  channel  while  its  absence 
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characterizes  the  channel  as  Rayleigh.  After  multipath  fading  “channel  processing”  the  aggregate  spread 
spectrum  signal  at  the  input  of  a  narrowband  uniform  linear  array  of  M  elements  is  given  by 

K  N 

*c(t)  Y]  ak,nUk(t  -  Tfc,n)afc;„  +  n(t)  (10.41) 

k= 1 n= 1 

where  N  is  the  total  number  of  resolvable  multipaths,  Tu,n  and  &k.n  are  respectively  the  delay  and  the  Mxl 
array  response  vector  for  the  nth  path  of  the  fcth  SS  signal,  and  otk,n ,  A:  =  1,  •  •  •  ,  K,  are  independent  zero- 
mean  complex  Gaussian  random  variables  that  model  the  fading  phenomena  and  are  assumed  to  remain 
constant  over  several  C/A  code  intervals.  We  recall  that,  the  fading  rate  is  the  maximum  Doppler  shift  in 
frequency  and  is  given  by  fm  =  vfo/c  Hz  where  v,  fo  and  c  denotes  the  velocity  of  the  receiver  (measured  in 
nr/sec),  the  carrier  frequency  (measured  in  Hz)  and  the  propagation  speed  (measured  in  m/sec),  respectively. 
As  an  example,  the  fading  rate  for  a  mobile  GPS  receiver  moving  at  35  mph  is  about  70  Hz,  which  implies 
that  the  channel  coherence  time  (the  reciprocal  of  fading  rate)  is  approximately  15  msec  while  the  information 
bit  period  is  20  msec.  If,  on  the  other  hand,  the  GPS  receiver  is  moving  almost  perpendicular  to  the  line  of 
sight  (e.g.  at  an  angle  89.5°)  then  the  channel  varies  at  a  frequency  fm  cos(89.5)  0.1  Hz. 

The  developments  presented  in  the  previous  sections  can  be  generalized  to  cover  multipath  fading  chan¬ 
nels.  In  particular,  the  filter  processors  (for  any  of  the  disjoint  or  joint-domain  configurations)  retain  the 
same  structure  (e.g.,  expressions  (10.10),  (10.14),  (10.20),  (10.22),  (10.23),  (10.27),  (10.35)-(10.40))  with 
the  understanding  that  the  spatial  “code”  a1;  the  temporal  code  d1;  and  the  space-time  code  dx  ®  aj  are 
substituted  by  their  multipath  fading  “channel  processed”  versions,  respectively.  Similarly,  corresponding 
covariance  matrices  are  appropriately  adjusted  to  reflect  the  above  multipath  fading  “channel  processing”. 
As  an  example,  the  multipath  version  of  the  joint  space-time  MF  now  takes  the  well  known  form  of  the 
RAKE  filter,  i.e.  the  “channel  processed”  space-time  effective  signature  of  the  signal  of  interest, 

w RAKB  =  -E'61{X6i}/M  =  di  ®>  ax  <g>  a1  (10.42) 

where  X  denotes  now  the  M  x  (LN)  space-time  data  matrix  created  by  sampling  at  the  chip  rate  the 
multipath  fading  received  signal  in  (10.41),  and  identifies  the  N  x  1  vector  of  the  multipath  fading 
channel  coefficients. 


10.7  Filter  Estimation  Considerations 

In  this  section  we  assume  availability  of  the  “channel-processed”  spatial,  temporal  or  space-time  code  needed 
to  implement  a  particular  receiver  configuration.  We  note,  however,  that  a  sample  average  implementation 
of  the  expectation  operation  in  (10.11),  (10.35)  or  (10.42),  for  example,  can  serve  as  a  supervised  estimator 
of  the  corresponding  channel  processed  signature.  For  instance,  for  joint-domain  processing,  wRAKE  = 
j  where  { X  j  }j  5l  is  a  sequence  of  J  space-time  received  data  vectors.  We  recognize  that 

if  sample  averaging  extends  over  multiple  multipath  channel  realizations  ( J  — »  oo)  then  ~wRAKE  converges 
either  to  the  direct  path  space-time  signature  of  the  signal  of  interest  when  Rician  fading  is  assumed  or 
to  the  0  vector  when  no  direct  path  exists,  i.e.,  Rayleigh  fading  is  assumed.  In  either  case,  in  order  to 
capture  the  prevailing  full  space-time  effective  signature,  vvRAKE  needs  to  be  continuously  reinitialized  at  a 
rate  that  is  consistent  with  the  fading  rate  or  a  corresponding  forgetting  factor  needs  to  be  embedded  in 
the  sample  average  procedure.  We  are  careful  to  note,  however,  that  estimation  of  the  channel-processed 
signature  through  supervised  or  blind  estimation  procedures  is  not  the  primary  subject  of  this  work.  What 
we  are  concerned  with  in  this  chapter  is  to  minimize  the  induced  BER  of  the  overall  system  with  a  given 
channel- processed  signature  vector  and  a  few  data  samples  XJ;  j  =  1, 2,  •  ■  •  ,  J.  Similar  considerations  hold 
for  the  disjoint  configurations. 

In  the  example  mentioned  in  the  previous  section,  the  fading  rate  for  a  mobile  GPS  receiver  moving  (along 
the  line  of  sight)  at  35  mph  is  about  70  Hz,  while  when  the  receiver  is  moving  approximately  perpendicular 
to  the  line  of  sight,  then  the  channel  varies  at  a  frequency  0.1  Hz.  The  former  case  implies  that  the  channel 
can  vary  faster  than  the  information  bit  rate,  while  for  the  latter  case  only  5ML  data  are  available.  As  a 
result,  adaptive  signal  processing  is  constrained  by  short  data  record  operational  characteristics.  This  is,  in 
fact,  the  basic  motivation  for  the  development  of  the  auxiliary-vector  GPS  processor  as  explained  below. 
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Besides  the  fixed  MF-type  structure,  all  other  filter  alternatives  discussed  in  the  previous  section  (that  is 
MVDR  or  AV  type)  are  adaptive  in  nature  and  have  been  presented/formulated  under  ideal  conditions,  that 
is,  under  the  assumption  that  the  involved  covariance  matrix  R  is  known.  Here  R  denotes  either  the  spatial  or 
temporal  or  space-time  receiver  input  covariance  matrix.  In  practice,  however,  R  is  unknown  and  it  is  sample- 
average  estimated  by  a  data  record  of  finite  size.  When  R  is  substituted  by  the  sample  average  estimate  R 
then  the  ideal  receiver  expressions  in  (10.14),  (10.20),  (10.23),  (10.27),  (10.36),  (10.40)  assume  their  estimated 
versions.  In  this  context,  the  auxiliary- vector  algorithm,  in  addition  to  its  own  merit  in  the  context  of  MS- 
optimum  signal  processing  theory,  produces  a  sequence  of  MVDR  filter  estimators  of  the  form  w(0),  w(l),  •  •  • 

.  This  sequence  was  extensively  studied  in  [27]  and  was  shown  to  offer  the  means  for  effective  control  over 
the  filter  estimator  bias  versus  covariance  trade-off.  As  a  result,  adaptive  filter  estimators  from  this  class 
have  been  seen  to  easily  outperform  in  expected  mean-square  estimation  error  (constraint)  LMS,  sample- 
matrix-inversion  (SMI)  and  RLS-type  adaptive  filter  implementations.  These  operational  characteristics 
of  the  AV  filter  estimators  place  them  favorably  in  terms  of  GPS  receiver  implementation  when  adaptive 
interference  suppression  with  short  data  records  is  the  objective.  Simulation  comparisons  in  the  following 
section  illustrate  how  the  above  observations  translate  into  superior  BER  performance. 


10.8  Numerical  and  Simulation  Comparisons 

We  consider  the  GPS  signal  model  in  (10.5)  for  a  system  with  M  =  2  antenna  elements  and  spreading  gain 
L  =1023.  In  all  cases  we  assume  that  4  satellite  signals  with  fixed  C/A  Gold  codes  are  in  view.  In  addition, 
we  assume  the  presence  of  one  or  two  high  power  spread-spectrum  jammers  (spoofers)  that  exhibit  code 
cross-correlation  with  the  signal  of  interest  ( signal  1 )  approximately  0.2  and  0.3,  respectively.  As  explained 
in  Section  10.1,  high  code  cross-correlation  models  the  jamming  effect  of  a  “GPS-looking”  hostile  signal. 
The  angles  of  arrival  of  the  satellite  signals  and  jammers  are  randomly  generated  according  to  a  uniform 
distribution  in  (— n/2,  7t/2)2. 

The  simulation/numerical  studies  in  this  section  evaluate  the  BER  performance  of  the  GPS  receiver  as 
a  function  of  either  the  signal-to-noise-ratio  (SNR)  of  the  signal  of  interest  or  the  data  record  size.  All  BER 
expressions  are  analytically  evaluated3.  The  results  for  the  one  and  two  jammer  case  are  averages  over  100 
and  500,  respectively,  independent  space-time  channels.  For  the  BER  versus  SNR  studies,  the  SNR  of  the 
signal  of  interest  varies  from  0  dB  (weak  signal  due  to  foliage  or  terrain  attenuation)  to  15  dB  (normal 
strength  signal4).  The  SNRs  of  other  satellite  signals  are  fixed  at  15  dB  while  the  jammer  SNR  is  fixed  at 
30  dB.  The  sampled  AWGN  is  assumed  to  be  i.i.d.  across  spatial  channels  (antenna  elements). 

In  Figs.  10.5-10.8  different  receiver  configurations  are  compared  in  terms  of  their  BER  performance  in 
the  absence  of  multipath  fading.  Fig.  10.5  plots  the  BER  versus  the  SNR  of  signal  1  for  the  estimated 
disjoint  S-T  cascade  and  joint  configurations  with  data  support  equal  to  8ML.  One  jammer  is  present.  We 
observe  that  the  sMF/tMV  configuration  performs  better  than  the  sMV/tMV  configuration.  This  behavior 
is  in  agreement  with  the  theoretical  studies  in  Section  10.3.3,  since  Pr{rj  >  0.25}  ps  0.86  for  this  simulation 
setup  (the  latter  probability  was  evaluated  numerically).  In  agreement  with  the  findings  in  [25]— [27] ,  AV 
designs  outperform  their  MVDR  counterparts.  For  this  study  both  the  JAV  filter  and  tAV  filter  utilize  5 
auxiliary  vectors. 

Fig.  10.6  plots  the  BER  versus  the  data  record  size  of  estimated  disjoint  S-T  configurations.  Fig.  10.7 
replicates  the  studies  of  Fig.  10.6  for  the  corresponding  disjoint  T-S  configurations.  Fig.  10.8  plots  the  BER 

2Table  10.1  in  the  appendix  presents  the  average  spatial  cross-correlation  rj  (evaluated  numerically)  between  two 
signals  with  DoA-separation  angles  in  [0,0. 27r],  (0.27t,7t],  and  [0,  tt]  when  an  antenna  array  of  M  =  2,  3,  4,  8,  or  10 
elements  is  utilized.  Uniform  DoA  distribution  (A 6  £  [0, 7r])  appears  as  a  reasonable  maximum-entropy  assumption 
that  represents/reflects  the  correlation  characteristics  of  the  jamming  environment  for  various  A 9  values. 

3  We  recall  that  the  received  vectors  are  distributed  according  to  a  Gaussian  mixture  pdf;  thus  the  BER  is  ana¬ 
lytically  evaluated  as  a  sum  of  Q(-)  functions  over  all  possible  bit  combinations  of  the  interferers  weighted  by  the 
probability  of  the  corresponding  bit  combination  (equally  probable  bit  combinations  in  our  case). 

4Under  normal  operating  conditions  the  nominal  thermal  noise  power  spectral  density  is  —205  dBW/Hz  [1],  which 
results  in  —142  dBW  noise  power  in  2  MHz  bandwidth.  Also,  the  minimum  specified  C/A  signal  power  at  the  receiver 
end  is  —160  dBW  while  the  actual  received  power  corresponding  to  a  conservative  link  budget  is  about  —157  dBW. 
The  latter  leads  to  —157  dBW—  (—142  dBW)  =  —15  dB  SNR  per  code  chip  which,  after  despreading  (that  is  30  dB 
gain),  results  in  —15  dB  +  30  dB  =  15  dB  pre-detection  SNR. 
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versus  the  data  record  size  for  the  estimated  joint-domain  configurations.  The  SNR  of  signal  1  in  Figs.  10.6- 

10.8  is  fixed  at  15  dB,  two  jammers  are  present,  the  sAV  filters  utilize  one  auxiliary  vector,  while  JAV  and 
tAV  filters  utilize  5  auxiliary  vectors. 

Fig.  10.9  shows  the  BER  performance  as  a  function  of  the  data  record  size  for  the  joint-domain  config¬ 
urations  and  a  few  selected  disjoint  configurations  in  the  presence  of  multipath  fading  (all  AV  filters  utilize 
5  auxiliary  vectors).  One  jammer  is  present  and  each  signal  experiences  N  =  3  independent  paths  that  are 
5 Tc  apart  with  angles  of  arrival  uniformly  distributed  in  (— 7t/2,  7t/2).  The  SNR  of  the  SS  jammer  is  fixed 
at  30  dB  while  the  total  SNR  of  the  GPS  signal  of  interest  (the  sum  of  the  received  SNR  over  all  paths)  is 
fixed  at  15  dB.  Following  the  notation  in  (10.41)  the  total  received  SNR  for  an  SS  signal  k,  k  =  1,  2,  ■  ■  ■  ,  K , 
is  defined  by  Ek  Yln=i  ^{\ak,n\2}/cr2,  where  a2  is  the  variance  of  the  sampled  AWGN. 

Figs.  10.5-10.9  illustrate  the  performance  gains  when  non-MF-type  signal  processing  is  performed  by 
the  GPS  receiver  and  do  not  consider  combining.  Additional  performance  gains  obtained  through  EGG 
combining  are  illustrated  in  Fig.  10.10  where  we  plot  the  BER  as  a  function  of  the  data  record  size  for 
selected  receiver  configurations  of  the  previous  figures. 

10.9  Conclusions 

Disjoint  space  and  time  domain  as  well  as  joint  space-time  signal  processing  algorithms  have  been  investigated 
for  GPS  navigation  data  demodulation  in  the  presence  of  multiple- access  satellite  interference,  intelligent 
SS  interference  highly  correlated  with  the  signal  of  interest  and  AWGN.  In  particular,  linear  GPS  filters 
were  derived  based  on  MF,  MVDR  and  AV  processing  principles.  The  relative  merits  of  different  disjoint 
domain  configurations  were  analytically  evaluated  and  summarized  in  Theorem  10.1.  Simulation  results 
illustrated  the  performance  of  GPS  filter  estimators  of  both  disjoint  and  joint  domain  configurations  under 
different  interference  conditions,  channel  conditions,  and  data  record  sizes.  It  was  demonstrated  that  joint 
domain  adaptive  processing  configurations  have  advantages  over  disjoint  domain  schemes.  However,  these 
advantages  quickly  diminish  under  limited  data  support  where  disjoint  (or  joint)  auxiliary-vector-receivers 
were  seen  to  offer  effective  SS  interference  suppression  and  superior  BER  performance.  Finally,  further  BER 
performance  improvements  under  limited  data  support  were  sought  through  output  filter  combining  methods 
that  exploited  the  redundancy  inherent  in  the  GPS  signal  due  to  the  20-times  repetition  of  the  C/A  code  in 
a  single  information  bit  period. 


10.10  Appendix 

10.10.1  Proof  of  Theorem  10.1 

Let  Ej  and  T;  =  Ei/a2  denote  the  input  energy  and  input  SNR,  respectively,  of  signal  i,  i=l,  2  (signal  1  is 
the  signal  of  interest),  where  a2  is  the  input  noise  variance.  Let  also  g  =  d^d2  and  (  =  denote  the 

temporal  and  (complex)  spatial  cross-correlation,  respectively  (we  note  that  p  =  |p|  and  rj  =  |<(j). 


Part  A  |  We  first  evaluate  the  spatial  and  temporal  filter  as  well  as  the  overall  system  output  interference- 
plus- noise  variance  PIN  as  a  function  of  Ti,  r2,  p  and  rj.  Since  all  design  alternatives  involve  filters  that 
preserve  the  energy  of  the  signal  of  interest  ( signal  1 ),  the  output  variance  PIN  is  inversely  proportional  to 
output  SINR. 
sMF/tMF 


sMF/tMVDR 


W  sMF  =  ai/M 

(10.43) 

W1HF  =  dl 

(10.44) 

PIN(sMF/tMF)  2  2.  1 

^  =T27ip+m- 

(10.45) 

MF  =  &-1 /M 

(10.46) 
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where 

sMVDR/tMVDR 


Vr2+1 


di-: 


l/r2+l-p2  V  1/r'a+l 

PIN(sMF/tMv)  T2p2p2 


[i  +  r'2(i-p2 


1 

M 


P2(l-P2) 

(l/r2  +  l-p2)2 


r'2  =  MT2r,2 


where 


1/r2  +  l  ,  n _ 

”MV  i/r"+i-772  V  1  i/r;{+i 


a2  /M 


i/r'2"+i 
i/r'2"+i-p2 

DIN(sMv/tMv) 


d,  - 


crz 


P2(1~P2) 


i/r'2"+i  2 

_ r2pV _ 

[l+r"(l-p2)]2[1+r»(l_p2)]2 


p2(i-p2) 


Mr2 

L 

MY2r,2 

(i  +  r'2')2  -r"(2  +  r")p2' 


sMVDR/tMF 

The  w.mv  and  wtMF  filters  are  given  by  (10.50)  and  (10.44)  respectively.  Then, 


Pin(smv /t.MF) 


r 2V  p 


2„2 


[1  +  T"(l  -  772)]2 


1 

M 


P2(1~P2) 

(l/r'+i-p2)2 


(10.47) 

(10.48) 

(10.49) 

(10.50) 

(10.51) 

(10.52) 

(10.53) 

(10.54) 


(10.55) 


where  r2  is  given  by  (10.53).  The  relations  ( i )  and  (ii)  follow  by  direct  comparison  of  (10.45),  (10.48), 
(10.52)  and  (10.55).  To  show  (iv)  we  observe  that  the  right  hand  side  of  (10.48)  is  equal  to 


r  2pV  i 

l  +  r'2(l  -p2)  M' 


(10.56) 


The  second  term  (noise  power)  of  the  right  hand  side  of  (10.52)  is  always  greater  than  the  second  term  of 
the  right  hand  side  of  (10.56)  and  the  sufficient  condition  can  be  derived  by  comparing  the  corresponding 
first  terms. 

Part  B  Part  B  of  Theorem  10.1  can  be  proved  in  a  similar  manner  after  deriving  the  dual  expressions  of 
(10.43)-(10.55)  by  direct  substitution  of  p2  by  p2,  p  by  £*,  ai  by  di,  a2  by  d2,  and  vice  versa. 

Part  C  Part  C  can  be  proved  by  direct  comparison  of  the  involved  expressions  (i.e.  (10.55)  and  (10.48)) 
obtained  in  Parts  A  and  B  above.  □ 


10.10.2  Average  Spatial  Cross-correlation  versus  Number  of  Antenna  Elements 

Table  10.1  below  shows  the  average  spatial  cross-correlation  fj  between  two  signals  whose  DoAs  are  A 6  apart, 
as  a  function  of  the  number  of  antenna  elements  M . 
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Fig.  10.1.  Space-time  cascade  receiver  structure. 


Fig.  10.2.  Time-space  cascade  receiver  structure. 


Fig.  10.3.  Output  SINR  comparisons  for  the  disjoint  sMF/tMVDR  and  sMVDR/tMVDR  configurations 
as  a  function  of  the  spatial  cross-correlation  p  and  the  temporal  cross-correlation  p. 


Fig.  10.4.  Joint  space-time  receiver  structure. 
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Fig.  10.5.  Bit-Error-Rate  as  a  function  of  the  SNR  of  the  signal  of  interest  for  disjoint  S-T  and 
joint  S-T  configurations  in  the  presence  of  one  jammer  (data  record  size  =8ML). 


1  2  3  4  5  6  7  8 

Data  Record  Size  (x  ML) 


Fig.  10.6.  Bit-Error-Rate  as  a  function  of  the  data  record  size  for  disjoint  S-T  configurations 
(SNRi=15  dB,  two  jammers  present). 
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Data  Record  Size  (x  ML) 

Fig.  10.7.  Bit-Error-Rate  as  a  function  of  the  data  record  size  for  disjoint  T-S  configurations 
(SNRi=15  dB,  two  jammers  present). 


1  2  3  4  5  6  7  8 

Data  Record  Size  (x  ML) 


Fig.  10.8.  Bit-Error-Rate  as  a  function  of  the  data  record  size  for  joint  space-time  configurations 
(SNRi=15  dB,  two  jammers  present). 
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Fig.  10.9.  Bit-Error-Rate  as  a  function  of  the  data  record  size  under  multipath  fading  (total 
SNRi=15  dB,  one  jammer  present). 


Fig.  10.10.  Bit-Error-Rate  as  a  function  of  the  data  record  size  after  EGC  combining  (SNRi=15 
dB,  two  jammers  present). 
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Table  10.1.  77  vs.  M 


M 

2 

3 

4 

5 

8 

10 

0  <  A 0  <  0.2tt 

0.91 

0.78 

0.67 

0.59 

0.44 

0.38 

AO  =  0.2vr 

0.70 

0.35 

0.26 

0.23 

0.13 

0.09 

0.27 r  <  AO  <  7 r 

0.54 

0.37 

0.30 

0.25 

0.17 

0.14 

0  <  A#  <  7T 

0.67 

0.52 

0.44 

0.38 

0.27 

0.23 
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Chapter  11 


Low  Complexity  Decoding  of  Finite 
Geometry  LDPC  Codes 

11.1  Introduction 

Low-density  parity-check  (LDPC)  codes,  originally  introduced  by  Gallager  [1,  2]  and  brought  into  prominence 
by  MacKay  and  Neal  [3,  4],  are  currently  attracting  a  great  deal  of  research  interest.  Indeed,  practical 
decoding  algorithms  operating  on  certain  long  LDPC  codes  were  seen  to  offer  exceptional  error  protection. 
On  the  other  hand,  although  various  code  design  techniques  are  available,  most  of  the  “good”  LDPC  codes 
known  so  far  are  generated  randomly  by  computer  and  are  lacking  a  certain  algebraic  structure.  This  impairs 
our  ability  to  carry  out  theoretical  analysis  or  develop  cost  effective  encoder  implementations. 

A  notable  exception  to  the  above  is  the  algebraically  constructed  finite  geometry  LDPC  (FG-LDPC) 
codes  [5].  Most  of  the  FG-LDPC  codes  are  cyclic  or  quasi-cyclic;  therefore,  their  encoding  can  be  easily 
implemented  by  linear  shift  registers  with  feedback  connections.  Compared  to  the  computer  generated 
LDPC  codes  with  the  same  code  length  and  dimension,  FG-LDPC  codes  have  a  more  balanced  structure 
and  better  performance,  especially  in  the  range  of  short  to  moderate  code  lengths  (102  —  104).  The  parity- 
check  matrices  of  FG-LDPC  codes  employ  some  redundant  checks  (the  number  of  rows  of  the  parity-check 
matrix  is  greater  than  the  difference  between  the  code  length  and  the  code  dimension)  and  row/column 
weights  are  also  somewhat  higher  than  usual  LDPC  codes.  This  implies  higher  decoding  complexity  than 
the  computer  generated  LDPC  codes,  which  may  be  the  price  for  the  improved  performance. 

In  this  chapter,  we  propose  a  new  decoding  algorithm  for  high-rate  FG-LDPC  codes.  LDPC  codes  draw 
their  error  correction  strength  from  their  relatively  long  length  in  comparison  with  conventional  linear  block 
codes.  Undetected  errors  at  the  decoder  are  exceedingly  rare.  The  codewords  are  distributed  sparsely  in  the 
TV-dimensional  space  over  GF{ 2)  and  in  almost  all  cases  we  either  find  the  correct  codeword  or  cannot  find 
any  codeword  at  all.  In  other  words,  when  we  search  around  the  received  vector  -according  to  a  properly 
defined  distance  metric-  we  expect  that  the  first  valid  codeword  to  encounter  will  be  the  correct  one  with 
exceedingly  high  probability.  In  this  work,  we  begin  indeed  with  the  hard-decision  received  vector  of  length  TV 
and  we  search  around  it  iteratively.  In  each  iteration,  we  flip  only  one  bit  of  the  length- TV  vector.  The  bit  to 
be  flipped  is  chosen  according  to  a  selection  metric  that  is  supposed  to  conform  well  with  the  characteristics  of 
FG-LDPC  codes  (many  cycles  of  length  six,  redundant  checks  and  somewhat  higher  than  usual  row/column 
weights) .  The  metric  combines  the  number  of  failed  checks  that  bits  participate  in  and  their  reliability.  After 
each  iteration,  the  metric  values  are  updated  as  needed.  Calculation  of  the  bit  selection  metric  requires  no 
knowledge  of  the  signal  energy  or  noise  power  characteristics  of  additive  white  Gaussian  noise  (AWGN) 
channels.  If  a  valid  codeword  is  found,  the  search  is  terminated  and  successful  decoding  is  claimed.  If  no 
codeword  is  found  after  a  certain  number  of  iterations  (search  upper  bound),  then  we  declare  a  decoding 
failure.  Last  but  not  least,  during  decoding  we  can  record  a  window  of  searched  vectors  along  the  search 
path  to  identify  and  break  closed  loops.  This  is  an  optional  modification  that  can  improve  the  decoding 
performance  without  much  complexity  increase. 
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11.2  The  Decoding  Algorithm 


11.2.1  Notation  and  Basic  Definitions 


Consider  a  binary  (TV,  A')  LDPC  code  with  length  TV  and  dimension  K.  We  choose  an  arbitrary  codeword 
c  =  [ci,  C2,  ■  •  ■  ,  cjv].  The  codeword  is  transmitted  over  a  symmetric  binary-input  memoryless  discrete-time 
channel  and  the  received  vector  is  denoted  by  r  =  , rjv].  Here,  by  “symmetric  channel”  we  mean 

that  f(r\c  =  1)  =  /(— r|c  =  0)  where  /(•(■)  is  the  conditional  probability  distribution  function  of  the  channel 
output.  We  define  the  log-likelihood  value  for  each  channel  output  rn ,  n  =  1,  2,  . . . ,  TV, 


A 

2/n  = 


log 


/MCn 

/(^n|c„ 


1) 

0) 


(11.1) 


and  the  initial  hard-limited  bit  decision 

^0)=sgn  {Vn)  (11-2) 

where  sgn(y)  =  1  if  y  >  0  and  sgn(y)  =  0  if  y  <  0.  The  absolute  value  of  yn  is  called  the  reliability  of 
the  initial  decision  bn  \  The  vectors  [|j/i|,  1 2/2 1 » ■  •  *  j  |l/Jv|]  and  b(0)  =  [ b b 2°\  •  •  •  ,  b$]  are  the  inputs  to  the 
decoder. 

A  binary  LDPC  code  is  completely  described  by  its  sparse  binary  parity-check  matrix  H.  For  an  (TV,  K ) 
LDPC  code,  H  has  TV  columns  and  M  >  TV  —  K  rows.  For  any  tentative  bit  decision  vector  b,  the  set  of 
check  sums  -or  syndrome-  is  the  vector  s  =  bHT.  For  a  regular  LDPC  code,  H  has  a  constant  column  weight 
tc  and  a  constant  row  weight  tr.  Following  exactly  the  notation  of  [4],  we  define  the  set  A f(m)  that  contains 
the  indices  of  the  tr  bits  that  participate  in  check  m  and  the  set  A4(n)  that  contains  the  indices  of  the  tc 
checks  in  which  bit  n  participates: 


Af(m)  =  {n  :  Hm„  =  1}  ,  M(n)  =  {m  :  Hmn  =  1}  . 


(11.3) 


For  each  check  1  <  m  <  M,  we  define  the  “lower  check  reliabilityv  value  lm  and  the  “upper  check  reliability’’’ 


value  um  as  follows: 


A 


min 

n£j/(m) 


\Vn\, 


um=  max  \yn\. 

n£M(m) 


(11.4) 


11.2.2  Algorithm 

The  proposed  algorithm  starts  from  the  hard- limited  received  data  vector  b(0)  and  searches  iteratively  the 
TV-dimensional  vector  space  over  GF{ 2)  for  a  valid  codeword  vector.  Assume  that  prior  to  the  A:th  iteration, 
k  >  1,  the  bit  vector  under  consideration  is  b^’-1)  with  corresponding  syndrome  vector  s(fc_1)  =  b^’_1)HT. 
The  objective  is  to  perturb  b^’-1)  by  one  bit  and  create  a  new  candidate  bit  vector  b(fc).  To  choose  which 
bit  of  b(fc_1)  to  flip  we  define  for  each  bit  n  =  1,  2,  . . . ,  TV  the  cumulative  metric  over  all  checks  in  A4(n) 


4fc)=  E  1  <  n  <  TV, 

mdJvMji) 

where  for  each  check  m  £  A4(n) 

,(fc)  a  (  \yn  \  —  lm/2,  if  Sm_1)  =  0 

1  \yn\-  (am  +  W2),  if  Sm_1)  =  l-1 


Then,  we  flip  the  bit  in  b^’  with 


minimum 


ej)n'1  value  to  obtain  b^: 


n ^  =  arg  min  <f> ^  and  b(fci  =  b(fc  ©  e(n<  ^ 

l<n<N 


(11.5) 


(11.6) 


(11.7) 


where  ©  denotes  addition  over  GF(2)  and  e(n<  ^  is  the  TV-dimensional  unit  vector  with  1  at  the  n*-fcHh 
position  and  0  everywhere  else.  We  update  the  syndrome  vector  to  s<k')  from  the  previous  syndrome 
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(b(fc)  and  b^fc  1  -*  differ  only  in  one  bit)  and  compare  with  the  all-zero  vector.  If  s ^  =  0,  then  we  have  a 
valid  codeword  and  we  stop  the  search.  If  not,  we  calculate  (jyn  ,  n  =  1,  2,  . . . ,  TV,  and  continue. 

The  basic  properties  of  the  selection  metric  in  (11.5),  (11.6)  are  as  follows,  (i)  If  bit  n  passes  check  m  G 
M  (n) ,  then  G  [Zm/2,  um  —  Zm/2].  (it)  If  bit  n  fails  check  m  G  M  (n),  then  G  [— (um  —  Zm/2),  —  Zm/2]. 
(Hi)  Among  all  bits  that  participate  in  check  m,  the  selection  rule  favors  the  least  reliable  ((11.6)  incorporates 
yn  as  a  fixed  term),  (iv)  Over  all  bits  and  all  checks,  the  selection  rule  favors  bits  that  participate  in  failed 
checks  (because  4>n,m  >  0  if  sm  =  0  and  4>n,m  <  0  if  sm  =  1).  In  essence,  the  selection  rule  combines  “the 
number  of  failed  checks”  and  “the  reliability”  into  a  new  criterion  function  (f>n.  The  criterion  function  cj>n 
maintains  linearity  with  respect  to  \yn\,  therefore  for  AWGN  channels  we  can  replace  \yn\  by  \rn\  and  still 
have  an  equivalent  decoder  that  requires  no  knowledge  of  signal  energy/noise  power. 


11.3  Improvement:  Loop  Detection 


In  the  algorithm  that  we  described  in  the  previous  section,  the  selection  of  the  bit  to  flip  depends  only  on 
the  current  bit  vector  b(fc_1)  and  the  reliability  vector  [|yi|,  •  •  •  ,  |j/jvj]: 

n(k)  =  argmin</>n(b(fc_1),[|yi|,---  ,\yN\])-  (11-8) 

n 


If  in  the  fcth  iteration  the  new  bit  vector  b ^  coincides  with  a  previously  considered  vector  b(fc°\  ko  <  k, 
then  n^k+1^  =  n(fco+1)  and  b^’+b  =  b(fc°+b.  By  induction,  b^fc+p)  =  b(fco+b  for  any  p  >  0  and  b^)  is  seen 
to  be  periodic.  Since  there  is  no  valid  codeword  vector  in  {b*fco\ b(fc°+1),  •  •  •  ,  b^fc'  '*)},  we  will  not  find  a 
valid  codeword  vector  if  we  continue  the  search.  The  decoding  process  is  trapped  into  an  infinite  loop  and 
a  decoding  failure  will  be  reported  when  the  maximum  allowable  iteration  number  is  reached. 

To  detect  and  break  such  infinite  loops  when  they  appear,  we  can  record  all  searched  vectors;  if  in  the 
A:th  iteration  we  attempt  to  generate  a  previously  searched  vector  b(fc°\  ko  <  k,  we  will  discard  this  bit-flip 
choice  and  select  instead  the  bit  immediately  next  in  (f)n  value.  At  first  sight,  it  appears  that  to  record 
and  compare  with  all  searched  vectors  has  complexity  proportional  to  the  code  length  N  (which  may  be 
prohibitive  for  long  LDPC  codes).  The  cost,  however,  can  be  greatly  reduced  as  we  see  in  the  sequel. 

For  convenience,  let  us  first  define  the  vector  sum  E(x)  =  Y^i=x+ 1  e<'"  ^  •  From  (11.7)  and  by  induction 

we  have  b(fc)  =  b(fc°)  ©5^=*.0+1  =  b*fc°)  ©E(fco)  for  any  0  <  ko  <  k.  Naturally,  we  reach  the  important 

conclusion  that 

b(fc)  =  b(fco)  if  and  only  if  E (k0)  =  0.  (11.9) 

This  means  that,  to  detect  and  prevent  infinite  loops,  we  need  only  the  vectors  E (k  —  1),  E (k  —  2),  •  •  • ,  E(0). 
If  any  of  these  vectors  is  zero,  an  infinite  loop  is  detected  and  the  bit  selection  n ^  should  be  discarded.  We 
notice  also  that  since  E (k  —  1),  •  •  •  , E(0)  depend  only  on  ■  ■  ■  , there  is  no  need  to  record 

the  searched  bit  vectors  b^fc_1\  •  •  • ,  b(0)  (which  otherwise  would  impose  a  significant  storage  requirement 
for  long  codes). 

In  more  detail,  we  can  calculate  E (l)  iteratively: 


E (l  ~  1) 


E(Z)  ©  © 


(A!)) 


if  l  =  k 
otherwise. 


(11.10) 


To  compare  E(/),  l  =  0, 1,  •  •  •  ,  A:  —  1,  with  the  all-zero  vector,  we  denote  its  Hamming  weight  by  w(l).  Then, 
w(l  —  1)  can  be  easily  derived  from  w(l)  without  counting  the  number  of  l’s  in  E (l): 


w(l  —  1) 


1,  if  l  =  k 

w(l)  —  1,  if  n^)th  bit  of  E(/)  =  1 
w(l)  +  1,  if  n^)th  bit  of  E(/)  =  0. 


(11.11) 


Of  course,  E (l)  =  0  if  and  only  if  w(l)  =  0. 

The  proposed  loop  detection  procedure  does  not  depend  on  our  specific  criterion  function  <j>n  and  can  be 
used  in  conjunction  with  other  bit-flipping-type  decoding  algorithms  as  well. 

1lm/ 2  and  um+lm/ 2,  m  =  1,  2,  . . . ,  M,  need  to  be  calculated  only  once  before  the  first  iteration  of  the  algorithm. 
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Table  11.1.  Decoding  Complexity  per  Iteration  (Unit  One  Real  Operation) 


Algorithm 

Multiplications 

Divisions 

Additions 

BP  algorithm 

llTVtc  -  6(TV  +  M) 

T V(ic  +  1) 

TV(3TC  +  1) 

Normalized  BP-based  [7] 

0 

Ntc 

N(4tc  —  3)  +  TU(  riog2  tr~\  -2) 

Normalized  APP-based  [8] 

0 

Ntc 

2TVtc  +  TU(["log2  tr]  —2) 

UMP  BP-based  (min-sum)  [6,  9] 

0 

0 

TV(4ic  —  3)  +  M[log2  tr]  -2) 

Weighted  BF  [5] 

0 

0 

N  —  1  -f-  tctr 

Proposed  algorithm 

0 

0 

N  —  1  -f-  tctr 

In  summary,  the  complete  decoding  algorithm  of  Section  11.2  that  incorporates  loop  detection  and 
prevention  is  as  follows: 

(i)  Initialization:  Iteration  counter  k  <—  0;  “Exclusion  list”  B  G-  0;  lm/2,  um  +  Zm/2,  m  =  1,  2,  . . . ,  M, 
calculated  by  (11.4);  b(0)  calculated  by  (11.2). 

(ii)  Calculate  syndrome  if  s^’)  =  0,  then  return  b^’)  and  STOP. 

(iii)  k  <—  k  +  1;  if  k  >  A:max  (user  specified  maximum  iteration  number),  then  declare  decoding  failure  and 
STOP. 

(iv)  Calculate  (j)n'\  n  =  1,  2,  . . . ,  TV,  by  (11.5),  (11.6). 

(v)  G-  argmin„^B  (f>n\ 

(vi)  Calculate  E(Z),  w(l)  by  (11.10),  ( 11.11),  l  s?  k  —  1,  k  —  2,  •  ■  ■  ,  0;  if  w(l )  =  0  for  any  l  =  k  —  1,  •  •  •  ,0,  then 
B  <—  B  U  { n (O  }  and  go  back  to  Step  5. 

(vii)  Calculate  b^’)  by  (11.7);  B  G-  0;  go  to  Step  2. 


11.4  Computational  Complexity 


The  computational  complexity  of  the  decoding  algorithm  described  above  can  be  calculated  as  follows. 
During  each  iteration,  one  bit  of  the  TV-dimensional  bit  vector  b  is  flipped.  To  select  this  bit  according  to 
the  metric  <j)n ,  TV  —  1  real  comparisons  are  needed.  After  bit  n  is  selected  and  flipped,  tc  syndrome  bits  that 
correspond  to  the  tc  checks  in  which  bit  n  participated  should  be  updated  (flipped): 


Sm  1  -  3m,  m  G  M(n). 


(11.12) 


Then,  for  each  m  G  M(n),  we  update  <j>n  for  all  n  G  A f(m): 


4>n  t— 


{ 


4>n  T 

0n  Urm 


if  Sm  =  0 
if  Sr,,  -  1  ’ 


n  G  A f(m). 


(11.13) 


Since  comparisons  can  be  considered  as  additions,  we  need  altogether  TV  —  1  +  |A/]|.M|  =  TV  —  1  +  tctr  real 
additions  per  iteration.  Both  the  comparison  and  the  update  of  (f>n  can  be  parallelized  and  accomplished  in 
[log2TV]  and  tr  time  units,  respectively. 

Loop  detection,  as  described  in  Section  11.3,  requires  only  bit  operations  and  integer  counter  incre¬ 
ments/decrements.  Most  importantly,  the  total  number  of  loop  detection  operations  is  independent  of  the 
block  length  TV  and  depends  only  on  the  total  number  of  executed  iterations.  Therefore  the  cost  of  loop 
detection  in  comparison  with  the  TV  —  1  +  tctr  real  additions  required  by  the  core  decoding  procedure  can 
be  safely  ignored. 

In  Table  11.1,  we  summarize  the  decoding  complexity  in  real  number  operations  per  iteration  for  several 
algorithms:  Belief  propagation  (BP),  three  simplified  versions  of  BP,  namely  the  “normalized  BP-based”  [7], 
“normalized  APP-based”  [8],  and  “uniformly  most  powerful  (UMP)  BP-based”  [6]  algorithms,  the  “weighted 
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bit-flipping  (BF)”  algorithm  [5],  and,  as  a  last  entry,  our  proposed  algorithm.  The  UMP  BP-based  algorithm 
coincides  with  the  “min-sum”  algorithm  [9,  Algorithm  B]  although  their  derivations  are  different.  UMP  BP- 
based  (min-sum),  weighted  BF,  and  the  proposed  algorithm  all  require  no  multiplication/division  operations 
at  all.  It  is  also  important  to  note  that  for  certain  LDPC  codes  the  normalized  BP-based  and  the  normalized 
APP-based  divisions  may  be  with  respect  to  a  power  of  2,  in  which  case  they  degenerate  to  trivial  binary 
shift  operations  [7,  8]. 

Analytic  statistical  evaluation  of  the  total  number  of  iterations  that  are  required  per  codeword  is  not 
an  easy  task  for  either  of  the  above  algorithms.  For  the  proposed  algorithm  in  particular,  since  only  one 
bit  is  flipped  in  each  iteration,  we  may  suggest  intuitively  and  validate  experimentally  a  statistical  model 
under  which  the  average  number  of  iterations  is  proportional  to  the  mean  number  of  bits  in  error  in  the 
hard-decision  received  vector.  The  mean  number  of  iterations  for  the  proposed  algorithm  is  expected  to  be 
higher  than  BP;  however,  since  the  operations  per  iteration  are  much  fewer,  we  can  count  on  significant 
complexity  savings  even  compared  with  simplified  versions  of  BP.  We  should  also  point  out  that  paral¬ 
lel  implementations  of  BP  decoders  still  need  approximately  10fc  +  5 tr  time  units  per  iteration  assuming 
availability  of  max{M,  N}  processors.  With  |dog2-/V]  +  tr  times  units  per  iteration  and  N/2  processors, 
our  proposed  algorithm  parallelized  would  maintain  speed  advantage  for  reasonably  long  codes  at  half  the 
hardware  complexity. 


11.5  Simulation  Results 

In  this  section,  we  evaluate  the  BER  of  various  LDPC  decoders  as  a  function  of  the  signal-to-noise  ratio 
(SNR).  To  allow  performance  comparisons  across  different  rate  codes  we  use  the  normalized  SNR  per  in¬ 
formation  bit  We  study  the  following  decoders:  (i)  BP,  (ii)  UMP  BP-based  [6]  (min-sum), 

(Hi)  normalized  BP-based  [7],  (iv)  normalized  APP-based  [8],  (v)  Gallager’s  original  bit-flipping  (BF)  al¬ 
gorithm  [2],  (vi)  weighted  BF  introduced  in  [5],  and  (vii)  the  algorithm  proposed  herein.  The  maximum 
iteration  number  is  set  equal  to  200  for  all  algorithms.  In  Fig.  11.1,  we  present  our  findings  for  the  Type- 
I  2-D  (1023,  781)  EG-LDPC  code  with  rate  0.763  [5].  The  normalization  factor  for  the  two  normalized 
decoders  is  set  equal  to  4.0  based  on  the  analysis  in  [7].  We  see  that  the  proposed  algorithm,  with  loop 
detection  and  prevention  as  described  in  Section  11.3,  offers  an  arguably  substantial  gain  of  0.48dB  at  BER 
of  10-5  over  the  similar  complexity  weighted  BF  algorithm  (the  gain  over  UMP  BP-based  and  plain  BF  is 
0.57dB  and  0.97dB,  respectively).  At  the  same  time,  the  proposed  algorithm  is  just  about  0.63dB  away  from 
straight  BP  decoding,  0.61dB  from  normalized  APP-based  decoding,  and  0.56dB  from  normalized  BP-based 
decoding.  With  these  performance  studies  in  mind,  we  move  on  to  algorithmic  complexity  comparisons.  We 
plot  the  mean  number  of  real  operations  needed2  (real  operations  per  iteration  x  mean  iteration  number) 
for  each  algorithm  versus  yb/TVo  (Fig.  11.2).  In  view  of  the  results  in  Figs.  11.1  and  11.2,  we  argue  that 
the  proposed  algorithm  and  the  normalized  APP-based  algorithm  of  [7]  offer  the  two  best  trade-off  points 
between  performance  and  computational  cost  (a  “low-end”  and  a  “high-end”  trade-off  point,  respectively). 
As  a  form  of  visual  aid  that  may  help  us  gain  a  better  understanding  of  the  underlying  algorithmic  behavior, 
in  Fig.  11.3  we  plot  the  empirical  “distribution”  of  the  number  of  iterations  needed  by  each  algorithm  to 
decode  one  received  vector.  We  fix  jb/No  at  3.85dB  and  we  run  each  algorithm  800,000  times.  The  x-axis 
represents  the  iterations  needed  to  decode  a  received  vector  and  the  y-axis  represents  the  density  of  appear¬ 
ance  of  an  iteration  number  as  observed  in  the  simulation.  For  effective  visualization,  we  use  a  log  scale 
for  the  y-axis.  We  see  that  for  the  BP,  UMP  BP-based,  normalized  BP-based  and  APP-based  algorithms, 
most  successful  decodings  terminate  during  the  first  10  to  15  iterations  at  the  price  of  high  complexity  per 
iteration.  Weighted  BF  and  the  proposed  algorithm  with  or  without  loop  detection  have  frequency  of  occur¬ 
rence  peaks  at  about  25  iterations.  The  algorithm  without  loop  detection  has  almost  all  successful  decodings 
concentrated  in  the  first  65  or  so  iterations.  The  remaining  decodings  fail  as  seen  by  the  spike  at  iteration 
200.  Apparently,  a  good  number  of  decoding  failures  can  be  attributed  to  infinite  loop  traps  as  seen  by  the 
operation  of  the  version  of  the  algorithm  that  incorporates  loop  detection  and  prevention  (bottom  entry  in 
Fig.  11.3). 

To  verify  that  our  findings  and  conclusions  continue  to  hold  true  for  much  longer  FG-LDPC  codes  than 

2In  Figs.  11.2  and  11.5,  the  operations  for  the  BP  algorithm  are  multiplications+divisions-l-additions.  For  all  other 
algorithms,  the  operations  are  additions  only  (multiplications/divisions  are  not  required  at  all). 
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Fig.  11.1.  Bit-error-rate  versus  rate  normalized  SNR  for  the  Type-I  2-D  (1023,  781)  EG-LDPC 
code. 


the  (1023,  781)  code  of  Figs.  11.1-11.3,  in  Fig.  11.4  we  consider  the  Type-II  3-D  (4599,  4227)  EG-LDPC  code 
with  rate  0.919  (the  normalization  factor  is  4.0  again).  At  BER  of  10-5,  the  proposed  low  cost  algorithm 
is  again  about  0.6dB  away  from  BP,  less  than  0.4dB  from  normalized  BP-based  decoding,  and  only  1.7dB 
away  from  the  Shannon  limit3.  Compared  to  weighted  BF,  the  algorithm  offers  a  performance  improvement 
of  about  0.8dB.  Fig.  11.5  establishes  the  proposed  algorithm  as  the  lowest  complexity  scheme  among  all  BP, 
BP-based,  APP-based,  and  weighted  BF  decoders. 


3The  minimum  distance  of  this  code  is  only  9.  In  our  study  of  Fig.  11.4,  we  observed  that,  except  for  the  normalized 
BP-based  decoder,  all  other  decoders,  including  BP,  generated  a  considerable  percentage  of  undetected  errors. 
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Fig.  11.2.  Real  operations  needed  versus  rate  normalized  SNR  for  the  code  of  Fig.  11.1. 
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Fig.  11.3.  Frequency  of  occurrence  of  executed  iterations  for  the  code  studies  in  Figs.  11.1  and  11.2 
(7fcM)=3.85dB). 
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